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TRANSPORTERS AND ION CHANNELS 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of transporters and ion 
5 channels and to the use of these sequences in the diagnosis, treatment, and prevention of transport, 
neurological, muscle, immunological, and cell proliferative disorders, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of 
transporters and ion channels. 

10 BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells 
and organelles require transport proteins to import and export essential nutrients and metal ions 
including K\ NH/, ?„ SOf, sugars, and vitamins, as well as various metabolic waste products. 

15 Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic 
neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse cell 
functions (Griffith, J. and C. Sansom (1998) Th* Transporter Facts Book, Academic Press, San Diego 
CA, pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can be 
linked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 

20 transport include carrier proteins, which bind to a specific solute and undergo a conformational 
change that translocates the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient. 

Carrier proteins which transport a single solute from one side of the membrane to the other 

25 are called uniporters: In contrast, coupled transporters link the transfer of one solute with 

simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 

30 sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 
NaVK + ATPase system. Sodium-coupled transporters include the mammalian glucose transporter 
(SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have 
twelve putative transmembrane segments, extracellular glycosylate sites, and cytoplasmically- 
oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of 

35 various thyroid pathologies because it is the molecular basis for radioiodide thyroid-imaging 
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techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) 
Proc. Nad. Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and 
placenta, and is implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate 
(Prasad, P.D. et al. (1998) J. Biol. Chem. 273:7501-7506). 
5 One of the largest families of transporters is the major facilitator superfamily (MFS), also 

called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MFS are found in all classes 
of living organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, 
nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have a structure 
10 comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. Rev. 62:1- 
34). The largest family of MFS transporters is the sugar transporter family, which includes the seven 
glucose transporters (GLUT1-GLUT7) found in humans that are required for the transport of glucose 
and other hexose sugars. These glucose transport proteins have unique tissue distributions and 
physiological functions. GLUT1 provides many cell types with their basal glucose requirements and 
15 transports glucose across epithelial and endothelial barrier tissues; GLUT2 facilitates glucose uptake 
or efflux from the liver, GLUT3 regulates glucose supply to neurons; GLUT4 is responsible for 
insulin-regulated glucose disposal; and GLUT5 regulates fructose uptake into skeletal muscle. 
Defects in glucose transporters are involved in a recently identified neurological syndrome causing 
infantile seizures and developmental delay, as well as glycogen storage disease, Fanconi-Bickel 
20 syndrome, and non-insulin-dependent diabetes mellitus (Mueckler, M. (1994) Eur. J. Biochem. 
219:713-725; Longo, N. and L.J. Hsas (1998) Adv. Pediatr. 45:293-313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are 
25 predicted to have twelve transmembrane (TM) helical domains with a large intracellular loop between 
TM6 and TM7, and play a critical role in rnaintaining intracellular pH by removing the protons that 
are produced stoichiometrically with lactate during glycolysis. The best characterized 
H + -monocarboxylate transporter is that of the erythrocyte membrane, which transports L-lactate and a 
wide range of other aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate 
30 transporters with differing substrate and inhibitor selectivities. In particular, cardiac muscle and 
tumor cells have transporters that differ in their K,, values for certain substrates, including 
stereoselectivity for L- over D-lactate, and in their sensitivity to inhibitors. There are 
Na + -monocarboxylate cotransporters on the luminal surface of intestinal and kidney epithelia, which 
allow the uptake of lactate, pyruvate, and ketone bodies in these tissues. In addition, there are 
35 specific and selective transporters for organic cations and organic anions in organs including the 
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kidney, intestine and liver. Organic anion transporters are selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the ammonium 
transporter, mediate the secretion of a variety of drugs and endogenous metabolites, and contribute to 
the maintenance of intercellular pH (Poole, R.C. and A.P. Halestrap (1993) Am. J. Physiol. 

5 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and L 
Haggstrom(1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane 
proteins that transport substances ranging from small molecules such as ions, sugars, amino acids, 
peptides, and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC 

10 transporters consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP 
to supply the energy required for transport, and two membrane-spanning domains (MSD), each 
containing six putative transmembrane segments. These four modules may be encoded by a single 
gene, as is the case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. 
When encoded by separate genes, each gene product contains a single NBD and MSD. These "half- 

15 molecules" form homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based 
major histocompatibility (MHC) peptide transport system Several genetic diseases are attributed to 
defects in ABC transporters, such as the following diseases and their corresponding proteins: cystic 
fibrosis (CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), 
Zellweger syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic 

20 hypoglycemia (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) 
protein, another ABC transporter, in human cancer cells makes the cells resistant to a variety of 
cytotoxic drugs used in chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 
292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
25 selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
copper is involved in hemoglobin synthesis, connective tissue metabolism, and bone development, by 
acting as a cofactor in oxidoreductases such as superoxide dismutase, ferroxidase (ceruloplasmin), 
and lysyl oxidase. Copper and other metal ions must be provided in the diet, and are absorbed by 
transporters in the gastrointestinal tract. Plasma proteins transport the metal ions to the liver and 
30 other target organs, where specific transporters move the ions into cells and cellular organelles as 
needed. Imbalances in metal ion metabolism have been associated with a number of disease states 
(Danks, D.M. (1986) J. Med. Genet. 23:99-106). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
35 acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 
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Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane 
segments, is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as 
muscle, heart, and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose 
conversion, and expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et 

5 al. (1998) J. Biol. Chem. 273:27420-27429). 

Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoglutarate/malate carrier, the phosphate carrier protein; the pyruvate 
carrier, the dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the 

10 tricarboxylate carrier which transports citrate and malate; and the Grave's disease carrier protein, a 
protein recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting 
in hyperthyroidism. Proteins in this family consist of three tandem repeats of an approximately 100 
amino acid domain, each of which contains two transmembrane regions (Stryer, L. (1995) 
Biochemistry. W.H. Freeman and Company, New York NY, p. 551; PROSITE PDOC00189 

15 Mitochoncirial energy tr^ 
*275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 

20 proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 
proposed as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. 
(1999) J. Int. Med. 245:637-642). 
" Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
25 ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 
selective pores within the membrane. There are two basic types of ion channels, ion transporters and 
gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of an 
ion down the ion's electrochemical gradient under restricted conditions. Together, these types of ion 
30 channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical 
impulse conduction down the axon of a nerve cell, 2) transport of molecules into cells against 
concentration gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Tnti Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
35 energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 
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These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ -ATPase, and H + -ATPase, are activated by a 
phosphorylation event P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca» are low and cytosolic concentration 

5 of K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular 

organelles, such as lysosomes and Golgi. V-class ion transporters are responsible for generating the 
low pH within the lumen of these organelles that is required for function. The coupling factor (F) 
class consists of H + pumps in the mitochondria. F^lass ion transporters utilize a proton gradient to 
generate ATP from ADP and inorganic phosphate (P,). 

10 The P-ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 

several large cytoplasmic regions that may play a role in ion binding (Scarborough, G.A. (1999) Curr. 
Opin. Cell Biol. 11:517-522). The V-ATPases are composed of two functional domains: the V, 
domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral 
complex responsible for proton translocation across the membrane. The F-ATPases are structuraUy 

15 and evolutionarily related to the V-ATPases. The F-ATPaseF 0 domain contains 12 copies of thee 
subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport. The V-ATPase V 0 domain 
contains three types of homologous c subunits with four or five transmembrane domains and the 
essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a subunit that 

20 may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. Biol. Chem. 
274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 
the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
25 (symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 
molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2+ out of the cell 
with transport of Na + into the cell (antiport). 
fiated Ion Channels 

Gated ion channels control ion flow by regulating the opening and closing of pores. The 
30 ability to control ion flux through various gating mechanisms allows ion channels to mediate such 
diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle contraction, 
fertilization, and regulation of ion and pH balance. Gated ion channels are categorized according to 
the manner of regulating the gating function. Mechanically-gated channels open their pores in 
response to mechanical stress; voltage-gated channels (e.g., Na + , K + , Ca 2+ , and CI" channels) open 
35 their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 
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acetylcholine-, serotonin-, and glutamate-gated cation channels, and GABA- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 

5 translational modifications, such as phosphorylation. 

Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently cloned 
from rat kidney. The SIC channel belongs to a group of channels which are activated by pressure or 

10 stress on the cell membrane and conduct both Ca 2t and Na + (Suzuki, M. et al. (1999) J . Biol. Chem. 
274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (?) located between S5 and S6, and intracellular amino and 
15 carboxy termini. In the Na* and Ca 2+ subfamilies, this domain is repeated four times, while in the K + 
channel subfamily, each channel is formed from a tetramer of either identical or dissimilar subunits. 
The P region contains information specifying the ion selectivity for the channel. In the case of K + 
channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc. Natl. Acad. 

Sci. USA 94:11651-11656). 

20 Voltage-gated Na + and K + channels are necessary for the function of electrically excitable 

cells, such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and 
muscle contraction, arise from large, transient changes in the permeability of the membrane to Na + 
and K + ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na* 
channels. Sodium ions flow into the cell, further depolarizing the membrane and opening more 

25 voltage-gated Na + channels, which propagates the depolarization down the length of the cell. 
Depolarization also opens voltage-gated potassium channels. Consequently, potassium ions flow 
outward, which leads to repolarization of the membrane. Voltage-gated channels utilize charged 
residues in the fourth transmembrane segment (S4) to sense voltage change. The open state lasts only 
about 1 millisecond, at which time the channel spontaneously converts into an inactive state that 

30 cannot be opened irrespective of the membrane potential. Inactivation is mediated by the channel's 
N-terminus, which acts as a plug that closes the pore. The transition from an inactive to a closed state 
requires a return to resting potential. 

Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore- 
forming a subunit that associates with two smaller auxiliary subunits, pi and P2. The P2 subunit is a 

35 integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 
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and pi subunits correlates with increased functional expression of the channel, a change in its gating 
properties, as well as an increase in whole cell capacitance due to an increase in membrane surface 
area (Isom, L.L. et al. (1995) Cell 83:433-442). 

Non voltage-gated Na + channels include the members of the amiloride-sensitive Na + 
5 channeVdegenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 
transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini 
located within the cell. The NaC/DEG family includes the epithelial Na* channel (ENaC) involved in 
Na + reabsorption in epitbelia including the airway, distal colon, cortical collecting duct of the kidney, 
and exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 and Liddle's 
10 syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the recently 

characterized iT-gated cation channels or acid-sensing ion channels (ASIC). ASIC subunits are 
expressed in the brain and form heteromultimeric Na + -permeable channels. These channels require 
acid pH fluctuations for activation. ASIC subunits show homology to the degenerins, a family of 
mechanically-gated channels originally isolated from C. elegans. Mutations in the degenerins cause 
15 neurodegeneration. ASIC subunits may also have a role in neuronal function, or in pain perception, 
since tissue acidosis causes pain (Waldmann, R. and M. Lazdunski (1998) Curr. Opin. Neurobiol. 
8:418-424; Eglen, R.M. et al. (1999) Trends Pharmacol. Sci. 20:337-342). 

K + channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels 
20 are involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 
potentials and repolarizing membranes, K + channels are responsible for setting resting membrane 
potential. The cytosol contains non-diffusible anions and, to balance this net negative charge, the cell 
contains a Na + -K + pump and ion channels that provide the redistribution of Na + , K + , and CI". The 
25 pump actively transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion channels in the 
plasma membrane allow K + and CI " to flow by passive diffusion. Because of the high negative charge 
within the cytosol, CI" flows out of the cell. The flow of K + is balanced by an electromotive force 
pulling K + into the cell, and a K + concentration gradient pushing K + out of the cell. Thus, the resting 
membrane potential is primarily regulated by K + flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489- 
30 492). 

Potassium channel subunits of the Shaker -like superfamily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to form 
functional K channels. These pore-forming subunits also associate with various cytoplasmic p 
subunits that alter channel inactivation kinetics. The Shaker -like channel family includes the voltage- 
35 gated K + channels as well as the delayed rectifier type channels such as the human ether-a-go-go 
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related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Curran, M.E. (1998) 
Curr. Opin. Biotechnol. 9:565-572; Kaczorowski, G.J. and ML. Garcia (1999) Curr. Opin. Chen, 
Biol. 3:448-458). 

A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 
Kir channels have the property of preferentially conducting K* currents in the inward direction. 
These proteins consist of a single potassium selective pore domain and two transmembrane domains , 
which correspond to the fifth and sixth transmembrane domains of voltage-gated K + channels. Kir 
subunits also associate as tetramers. The Kir family includes ROMK1, mutations in which lead to 
Bartter syndrome, a renal tubular disorder. Kir channels are also involved in regulation of cardiac 
pacemaker activity, seizures and epilepsy, and insulin regulation (Doupnik, C.A. et al. (1995) Curr. 
Opin. Neurobiol. 5:268-277; Curran, supra). 

The recently recognized TWK K + channel family includes the mammalian TWTK-1, TREK- 
1 and TASK proteins. Members of this family possess an overall structure with four transmembrane 
domains and two P domains. These proteins are probably involved in controlling the resting potential 
15 in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 2+ channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca* channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 
coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 
20 channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system The L-type and N-type voltage-gated Ca* channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed of 
three subunits. The a, subunit forms the membrane pore and voltage sensor, while the 0,5 and p 
subunits modulate the voltage-dependence, gating properties, and the current amplitude of the 
25 channel. These subunits are encoded by at least six a„ one 0,6, and four P genes. A fourth subunit, y, 
has been identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; 
McCleskey, E.W. (1994) Curr. Opin. Neurobiol. 4:304-312). 

The transient receptor family (Trp) of calcium ion channels are thought to mediate 
capacitative calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca* stores depleted 
30 by the action of inositol triphosphate (JP3) and other agents in response to numerous hormones and 
growth factors. Trp and Trp-like were first cloned from Drosophila and have similarity to voltage 
gated Ca2 + channels in the S3 through S6 regions. This suggests that Trp and/or related proteins may 
formrruurnnalian CCC entry channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) 
J. Biol. Chem 272:29672-29680). Melastatin is a gene isolated in both the mouse and human, and 
35 whose expression in melanoma cells is inversely correlated with melanoma aggressiveness in vivo. 
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The human cDNA transcript corresponds to a 1533-amino acid protein having homology to members 
of the Trp family. It has been proposed that the combined use of malastatin mRNA expression status 
and tumor thickness might allow for the determination of subgroups of patients at both low and high 
risk for developing metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 19:568-576). 
5 Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 

organelle pH. In secretory epithelial cells, CI ■ enters the cell across a basolateral membrane through 
an Na\ KVCT cotransporter, accumulating in the cell above its electrochemical equilibrium 
concentration. Secretion of CI " from the apical surface, in response to hormonal stimulation, leads to 
flow of Na + and water into the secretory lumen. The cystic fibrosis transmembrane conductance 
10 regulator (CFTR) is a chloride channel encoded by the gene for cystic fibrosis, a common fatal 
genetic disorder in humans. CFTR is a member of the ABC transporter family, and is composed of 
two domains each consisting of six transmembrane domains followed by a nucleotide-binding site. 
Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 
that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. The 
15 resulting blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and devastating 
"chronic obstructive pulmonary disease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 
domains, as well as two small globular domains known as CBS domains. The CLC subunits 
probably function as homotetramers. CLC proteins are involved in regulation of cell volume, 
20 membrane potential stabilization, signal transduction, and transepithelial transport. Mutations in 
CLC-1, expressed predominantly in skeletal muscle, are responsible for autosomal recessive 
generalized myotonia and autosomal dominant myotonia congenita, while mutations in the kidney 
channel CLC-5 lead to kidney stones (Jentsch, TJ. (1996) Curr. Opin. Neurobiol. 6:303-310). 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds 
25 to the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter 
binds to their extracellular domain. These channels exist in the postsynaptic membrane of nerve or 
muscle cells. Them are two types of neurotransmitter-gated channels. Sodium channels open in 
response to excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This 
opening causes an influx of Na* and produces the initial localized depolarization that activates the 
30 voltage-gated channels and starts the action potential. Chloride channels open in response to 
inhibitory neurotransmitters, such as y-aminobutyric acid (GABA) and glycine, leading to 
hyperpolarization of the membrane and the subsequent generation of an action potential. 
Neurotransmitter-gated ion channels have four transmembrane domains and probably function as 
pentamers (Jentsch, supra). Amino acids in the second transmembrane domain appear to be important 
35 in determining channel permeation and selectivity (Sather, W.A. et al. (1994) Curr. Opin. Neurobiol. 
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4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an influx of calcium 
during depolarization opens K + channels to modulate the magnitude of the action potential (Ishi et al., 
supra) . The large conductance (BK) channel has been purified from brain and its subunit 
composition determined. The a subunit of the BK channel has seven rather than six transmembrane 
domains in contrast to voltage-gated K + channels. The extra transmembrane domain is located at the 
subunit N-terminus. A 28-amino-acid stretch in the C-terminal region of the subunit (the "calcium 
bowl" region) contains many negatively charged residues and is thought to be the region responsible 
for calcium binding. The P subunit consists of two transmembrane domains connected by a 
glycosylated extracellular loop, with intracellular N- and C-termini (Kaczorowski, supra; Vergara, C. 
et al. (1998) Curr. Opin. Neurobiol. 8:321-329). 

Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 
examples of these are the cAMP-gated Na + channels involved in olfaction and the cGMP-gated cation 
channels involved in vision. Both systems involve ligand-mediated activation of a G-protein coupled 
receptor which then alters the level of cyclic nucleotide within the cell. CNG channels also represent 
a major pathway for Ca 2+ entry into neurons, and play roles in neuronal development and plasticity. 
CNG channels are tetramers containing at least two types of subunits, an a subunit which can form 
functional homomeric channels, and a p subunit, which modulates the channel properties. All CNG 
subunits have six transmembrane domains and a pore forming region between the fifth and sixth 
transmembrane domains, similar to voltage-gated K + channels. A large C-terminal domain contains a 
cyclic nucleotide binding domain, while the N-terminal domain confers variation among channel 
subtypes (Zufall, F. et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

The activity of other types of ion channel proteins may also be modulated by a variety of 
intracellular signalling proteins. Many channels have sites for phosphorylation by one or more 
protein kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all 
of which regulate ion channel activity in cells. Kir channels are activated by the binding of the Gfr 
subunits of heterotrimeric G-proteins (Reimann, F. and F.M. Ashcroft (1999) Curr. Opin. Cell. Biol. 
1 1:503-508). Other proteins are involved in the localization of ion channels to specific sites in the 
cell membrane. Such proteins include the PDZ domain proteins known as MAGUKs (membrane- 
associated guanylate kinases) which regulate the clustering of ion channels at neuronal synapses 
(Craven, S.E. and D.S. Bredt (1998) Cell 93:495-498). 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
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transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g., cystic fibrosis, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
5 membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (van* Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332: 1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of skeletal 
muscle, cardiac muscle, and the central nervous system Mutations in the pore-forming subunits of 
10 sodium and chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary 
contraction is delayed. Sodium channel myotonias have been treated with channel blockers. 
Mutations in muscle sodium and calcium channels cause forms of periodic paralysis, while mutations 
in the sarcoplasmic calcium release channel, T-tubule calcium channel, and muscle sodium channel 
cause malignant hyperthermia. Cardiac arrythmia disorders such as the long QT syndromes and 
15 idiopathic ventricular fibrillation are caused by mutations in potassium and sodium channels (Cooper, 
E.C. and L.Y. Jan (1998) Proc. Natl. Acad. Sci. USA 96:4759-4766). All four known human 
idiopathic epilepsy genes code for ion channel proteins (Berkovic, S J. and I.E. Scheffer (1999) Curr. 
Opin. Neurology 12:177-182). Other neurological disorders such as ataxias, hemiplegic migraine and 
hereditary deafness can also result from mutations in ion channel genes (Jen, J. (1999) Curr. Opin. 
20 Neurobiol. 9:274-280; Cooper, supra) . 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
25 Various classes of ion channels also play an important role in the perception of pain, and thus are 
potential targets for new analgesics. These include the vanilloid-gated ion channels, which are 
activated by the vanilloid capsaicin, as well as by noxious heat. Local anesthetics such as lidocaine 
and mexiletine which blockade voltage-gated Na + channels have been useful in the treatment of 
neuropathic pain (Eglen, supra) . 
30 Ion channels in the immune system have recently been suggested as targets for 

immunomoduladon. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking 
agents can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide 
antagonist of the T-cell potassium channel Kvl.3 was found to suppress delayed-type hypersensitivity 
35 and allogenic responses in pigs, validating the idea of channel blockers as safe and efficacious 
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immunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding them, 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of transport, neurological, muscle, immunological, and cell proliferative 
5 disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of transporters and ion channels. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 

10 collectively as "TRICH" and individually as "TRICH-1," "TRICH-2," "TRICH-3," 'TRICH-4," 
'TRICH-5," "TRICH-6," 'TRICH-7," "TRICH-8," 'TRICH-9," "TRICH-10," "TRICH-11," 
'TRICH-12," "TRICH-13," 'TRICH-14," 'TRICH-15," "TRICH-16," 'TRICH-17," "TRICH-18," 
"TRICH-19," "TRICH-20," "TRICH-21," "TRICH-22," "TRICH-23," "TRICH-24," «TRICH-25," 
'TRICH-26," "TRICH-27," "TRICH-28," 'TRICH-29," and "TRICH-30." In one aspect, the 

15 invention provides an isolated polypeptide selected from the group consisting of a) a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NO:1-30, b) a 
polypeptide comprising a naturally occurring amino acid sequence at least 90% identical to an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-30, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 

20 ID NO: 1-30, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-30. In one alternative, the invention provides an 
isolated polypeptide comprising the amino acid sequence of SEQ ID NO:1-30. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 

25 group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-30, c) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-30, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30. 

30 In one alternative, the polynucleotide encodes a polypeptide selected from the group consisting of 
SEQ ED NO: 1-30. In another alternative, the polynucleotide is selected from the group consisting of 
SEQ ID NO:31-60. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
35 consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
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of SEQ ID NO.1-30, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:1-30, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-30. In one alternative, the 
invention provides a ceU transformed with the recombinant polynucleotide. In another alternative, the 
invention provides a transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO:1-30, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-30. The method comprises a) 
culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is 
transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO.1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 

consisting of SEQ ID NO: 1-30. 

The invention further provides an isolated polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:31-60, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:31-60, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). In one alternative, the 
polynucleotide comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
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consisting of SEQ ID NO.31-60, b) a polynucleotide comprising a naturally occurring polynucleotide 
sequence at least 90% identical to a polynucleotide sequence selected from the group consisting of 
SEQ ID NO.31-60, c) a polynucleotide complementary to the polynucleotide of a), d) a 
polynucleotide complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The 
method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous 
nucleotides comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions whereby a 
hybridization complex is formed between said probe and said target polynucleotide or fragments 
thereof, and b) detecting the presence or absence of said hybridization complex, and optionally, if 
present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting 
of a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
SEQ ID NO.31-60, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:31-60, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof, and, optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30, and a pharmaceutically acceptable excipient. In one embodiment, the 
composition comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 1- 
30. The invention additionally provides a method of treating a disease or condition associated with 
decreased expression of functional TRICH, comprising administering to a patient in need of such 
treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 



14 



PCTVUS01/24217 

WO 02/12340 

acid sequence selected from the group consisting of SEQ ID NO:1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO:1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -30, and d) an 
5 immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting agonist activity in the sample. In one alternative, the 
invention provides a composition comprising an agonist compound identified by the method and a 
pharmaceutical^ acceptable excipient. In another alternative, the invention provides a method of 
10 treating a disease or condition associated with decreased expression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide 
15 comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid 

sequence selected from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:1-30, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:1-30. The method comprises a) exposing a sample comprising the 
20 polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the 
invention provides a composition comprising an antagonist compound identified by the method and a 
pharmaceutical^ acceptable excipient. In another alternative, the invention provides a method of 
treating a disease or condition associated with overexpression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 
25 The invention further provides a method of screening for a compound that specifically binds 

to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO.1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO.1-30, c) a biologically active fragment of a polypeptide 
30 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-30. The method comprises a) combining the polypeptide with at least 
one test compound under suitable conditions, and b) detecting binding of the polypeptide to the test 
compound, thereby identifying a compound that specifically binds to the polypeptide. 
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The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-30, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected 
5 from the group consisting of SEQ ID NO: 1-30, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:1-30. The method comprises a) combining the polypeptide with at least 
one test compound under conditions permissive for the activity of the polypeptide, b) assessing the 
10 activity of the polypeptide in the presence of the test compound, and c) comparing the activity of the 
polypeptide in the presence of the test compound with the activity of the polypeptide in the absence 
of the test compound, wherein a change in the activity of the polypeptide in the presence of the test 
compound is indicative of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
15 altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:31-60, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, and b) 
detecting altered expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
20 method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:31-60, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
25 polynucleotide sequence selected from the group consisting of SEQ ID NO:3 1-60, iii) a 

polynucleotide having a sequence complementary to i), iv) a polynucleotide complementary to the 
polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions 
whereby a specific hybridization complex is formed between said probe and a target polynucleotide 
in the biological sample, said target polynucleotide selected from the group consisting of i) a 
30 polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:31-60, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:3 1-60, 
iii) a polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide complementary 
to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Alternatively, the target • 
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polynucleotide comprises a fragment of a polynucleotide sequence selected from the group consisting 
of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing the amount of 
hybridization complex in the treated biological sample with the amount of hybridization complex in 
an untreated biological sample, wherein a difference in the amount of hybridization complex in the 
5 treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequences of the present invention. 
10 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 

homolog for polypeptides of the invention. The probability score for the match between each 

polypeptide and its GenBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including 

predicted motifs and domains, along with the methods, algorithms, and searchable databases used for 
15 analysis of the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 

polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 

sequences. 

Table 5 shows the representative cDNA library for polynucleotides of the invention. 
20 Table 6 provides an appendix which describes the tissues and vectors used for construction of 

the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

25 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
30 particular embodiments only, and is not intended to limit the scope of the present invention which 
will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an." 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
35 reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
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forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 

5 used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 

10 DEFINITIONS 

"TRICH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
15 TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TRICH either by directly interacting with 
TRICH or by acting on components of the biological pathway in which TRICH participates. 

An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
20 polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

25 "Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 

insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or a 
polypeptide with at least one functional characteristic of TRICH. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic variants, 

30 with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
TRICH. The encoded protein may also be "altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent TRICH Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 

35 residues, as long as the biological or immunological activity of TRICH is retained. For example, 



18 



WO 02/12340 " PCT/US01/24217 

negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hyorophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of TRICH Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TRICH either by 
directly interacting with TRICH or by acting on components of the biological pathway in which 
TRICH participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant 
Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albiimin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
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RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oUgonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-raethyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functionsofanaturaUyoccurringmolecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TRICH or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

15 "Complementary" describes the relationship between two single-stranded nucleic acid 

sequences that anneal by base-pairing. For example, 5'-AGT-3' pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide 
or amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding TRICH or fragments of TRICH may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl 
25 sulfate; SDS), and other components (e.g., Denhardfs solution, dry milk, salmon spennDNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from one or more overlapping cDN A, EST, or genomic DNA fragments using a computer 
30 program for fragment assembly, such as the GELVffiW fragment assembly system (GCG, Madison 
WD or Phrap (University of Washington, Seattle WA). Some sequences have been both extended and 
assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of 
35 the protein is conserved and not significantly changed by such substitutions. The table below shows 
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amino acids which may be substituted for an original amino acid in a protein and which are regarded 





as conservative amino acid substitutions. 

Original Residue 


PnrKprvative Substitution 




A 1«, 

Ala 


\J1Y, owl 


5 


Arg 






Asn 






Asp 


/\sn, VJIU 




Cys 


Ala Car 

/via, oer 




Gin 


A Cn frill TTlQ 


10 


Glu 


A pr\ /~11n "Hie 

/vsp, vjin, xus 




Gly 


Alo 

Ala 




His 


/vsn, Arg, vjin, yiu 




lie 


Leu, Val 




Leu 


lie, vai 


15 


Lys 


ATg, VJin, LrlU 




Met 


T ah He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 


20 


Tip 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


lie, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
25 backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 
30 The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 

Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
35 one biological or immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
absent gene or protein expression, determined by comparing at least two different samples. Such 
40 comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 
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"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins may be 

• assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

5 A "fragment" is a unique portion of TRICH or the polynucleotide encoding TRICH which is 

identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 

10 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 

15 specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:31-60 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:31-60, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:31-60 is useful, for 
20 example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:31-60 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO.-31-60 and the region of SEQ ID NO:31-60 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO:1-30 is encoded by a fragment of SEQ ID NO:31-60. A fragment 
25 of SEQ ID NO: 1-30 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO: 1-30. For example, a fragment of SEQ ID NO: 1-30 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO:1-30. The precise length of 
a fragment of SEQ ID NO: 1-30 and the region of SEQ ID NO:1-30 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
30 purpose for the fragment. 

A "full length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
"full length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between 
35 two or more polynucleotide sequences or two or more polypeptide sequences. 
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The tenns "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403^10), which is available 
from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nhn.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at htto://ww.ncbi.iilmnm.gov/gorf^l2.htinl. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 
Reward for match: 1 
Penalty for mismatch: -2 
Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
Filter: on 
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Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and .hydrophobic^ at the 
site of substitution,- thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 

example: 

Matrix: BLOSUM62 

Open Gap: Hand Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
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for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
5 length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 
10 The term "humanized antibody" refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
15 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in detennining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
20 conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68»C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 ng/tal sheared, denatured salmon sperm DNA. 
25 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and P H) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
30 conditions for nucleic acid hybridization are. well known and can be found in Sambrook, J. et al. 
(1989) ifetenW Honing: A I -ahnratorv Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, 
35 forlhour. Alternatively, temperatures of about 65°C, 60»C, 55°C, or 42°C may be used. SSC 
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concentration may be varied from about 0.1 to 2 x SSC with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 figtad. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 
5 circumstances.suchasforRNA-.DNAhybridizations. Useful variations on these wash conditions 
will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
The term "hybridization complex" refers to a complex formed between two nucleic acid 
10 sequences by virtue of the formation of hydrogen bonds between complementary bases. A 

hybridization complex may be formed in solution (e.g., C* or B* analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 
15 The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 

sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
20 cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of TRICH which is useful in any of the antibody production methods disclosed herein or known in the 
25 art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
30 The term "modulate" refers to a change in the activity of TRICH For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of TRICH. 

J The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
35 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
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antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

'Teptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

'Tost-translational modification" of an TRICH may involve lipidation, glycosylate, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemUuminescent agents, and enzymes. 
"Primers" are short nucleic acids, usually DNA oUgonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et al. (1989) MnW.nlar Cloninr: A Laboratory Manual, 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-btersciences, New York NY; Innis, M. et al. (1990) PC£ 
Protocols. A Guide to M.th.H. anH Applications. Academic Press, San Diego CA. PCR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
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purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

5 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 

10 megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/MIT Center for 
Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which 
sequences to avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the 
selection of oligonucleotides for microarrays. (The source code for the latter two primer selection 

15 programs may also be obtained from their respective sources and modified to meet the user' s specific 
needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping 
Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 

20 unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 

polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 

25 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 

30 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
35 vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
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expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5 1 and 3' untranslated regions 
(TJTRs). Regulatory elements interact with host or viral proteins which control transcription, 

5 translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cof actors; inhibitors; magnetic particles; and 

other moieties known in the art 
10 An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 

instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 

15 nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 

. cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

' The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

20 synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

25 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 

they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 

30 by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

35 A "transcript image" refers to the collective pattern of gene expression by a particular cell 
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type or tissue under given conditions at a given time. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 

5 sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed cells" includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 

10 as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 
A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 

15 of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants and animals. The isolated DNA of the present invention can be 

20 introduced into the host by methods known in the art, for example infection, transfection, 

transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
25 at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
30 sequence identity over a certain defined length. A variant may be described as, for example, an 

"allelic" (as defined above), "splice," "species," or "polymorphic" variant A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
35 reference molecule. Species variants are polynucleotide sequences that vary from one species to 
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another. The resulting polypeptides will generally have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
5 presence of SNPs may.be indicative of, for example, a certain population, a disease state, or a 

propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
10 1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 91%, at least 92%, at least 93%, at least 
94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

15 THE INVENTION 

The invention is based on the discovery of new human transporters and ion channels 
(TRICH), the polynucleotides encoding TRICH, and the use of these compositions for the diagnosis, 
treatment, or prevention of transport, neurological, muscle, immunological, and cell proliferative 
disorders. 

20 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to a 
single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is denoted 
by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incyte 
polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide sequence is 

25 denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID NO:) and 
an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 
polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 

30 polypeptidesequencenumber(mcytePolypeptidero)forpolypeptidesofmeinvention. Column3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. 
Column 4 shows the probability score for the match between each polypeptide and its GenBank 
homolog. Column 5 shows the annotation of the GenBank homolog along with relevant citations 
where applicable, all of which are expressly incorporated by reference herein. 

35 Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
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and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylate sites, as determined by the 
5 MOTIFS program of the GCG sequence analysis software package (Genetics Computer Group, 
Madison WI). Column 6 shows amino acid residues comprising signature sequences, domains, and 
motifs. Column 7 shows analytical methods fox protein structure/function analysis and in some cases, 
searchable databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
10 properties establish that the claimed polypeptides are transporters and ion channels. For example, 
SEQ ID NO:6 is 89% identical to rat neuronal nicotinic acetylcholine receptor subunit (GenBank ID 
g6746563) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 1.7e-188, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID NO:6 also contains a neurotransmitter-gated ion 
15 channel domain as determined by searching for statistically significant matches in the hidden Markov 
model (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 
BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ 
ID NO:6 is a neurotransmitter-gated ion channel. In an alternative example, SEQ ID NO: 14 is 93% 
identical to rat TAP-like ABC transporter (GenBank ID g6045150) as determined by the Basic Local 
20 Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, which 

indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ 
ID NO: 14 also contains an ABC transporter domain and an ABC transporter transmembrane region as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 
25 MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO: 14 is 
an ABC transporter. Li an alternative example, SEQ ID NO:16 is 98% identical to human voltage- 
dependent anion channel (GenBank ID g340199) as determined by the Basic Local Alignment Search 
Tool (BLAST). (See Table 2.) The BLAST probability score is 1.2e-130, which indicates the 
probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 16 
30 also contains a eukaryotic porin active site domain as determined by searching for statistically 

significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN analyses provide 
further corroborative evidence that SEQ ID NO: 16 is a mitochondrial porin. In an alternative 
example, SEQ ID NO:20 is 28% identical to a rat voltage-gated calcium channel (GenBank ID 
35 g4586963) as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The 
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BLAST probability score is 2.4e-27, which indicates the probability of obtaining the observed 
polypeptide sequence alignment by chance. Data from BLIMPS andBLAST analyses provide further 
corroborative evidence that SEQ ID NO:20 is a voltage-gated calcium channel. In an alternative 
example, SEQ ID NO:22 is 82% identical to human inhibitory glycine receptor (GenBank ID g31849) 
5 as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is l.le-175, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:22 also contains a neurotransmitter-gated ion channel 
domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 
10 BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ 
ID NO:22 is a neurotransmitter-gated ion channel. In an alternative example, SEQ ID NO:30 is 36% 
identical to human ATP binding cassette (ABC) -C transporter (GenBankID gl514530) as 
determined by the Basic Local Alignment Search Tool (BLAST, see Table 2). The BLAST 
probability score is 2.3e-127, which indicates the probability of obtaining the observed polypeptide 
15 sequence alignment by chance. SEQ ID NO:30 also contains ABC transporter domains as determined 
by searching for statistically significant matches in the hidden Markov model (HMM)-based PFAM 
database of conserved protein family domains (see Table 3). Data from BLIMPS, MOTIFS, and 
PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO:30 is an ABC 
transporter. SEQ ID NO: 1-5, SEQ ID NO:7-13, SEQ ID NO: 15, SEQ ID NO:17-19, SEQ ID NO:21, 
20 and SEQ ID NO:23-29 were analyzed and annotated in a similar manner. The algorithms and 
parameters for the analysis of SEQ ID NO.1-30 are described in Table 7. 

As shown in Table 4, the full length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 
combinationofthesetwotypesofsequences. Columns 1 and 2 list the polynucleotide sequence 
25 identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments 
of the polynucleotide sequences which are useful, for example, in hybridization or amplification 
technologies that identify SEQ ID NO:31-60 or that distinguish between SEQ ID NO:31-60 and 
30 related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
polynucleotide sequences of the invention. Columns 6 and 7 of Table 4 show the nucleotide start (5') 
and stop (3') positions of the cDNA and/or genomic sequences in column 5 relative to their respective 
35 full length sequences. 
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The identification numbers in Column 5 of Table 4 may refer specifically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 6340750H1 is the 
identification number of an Incyte cDNA sequence, and BRANDIN01 is the cDNA library from 
which it is derived. Incyte cDNAs for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries (e.g., 71911330V1). Alternatively, the identification numbers in column 5 
may refer to GenBank cDNAs or ESTs (e.g., g5110579) which contributed to the assembly of the full 
length polynucleotide sequences. In addition, the identification numbers in column 5 may identify 
sequences derived from the ENSEMBL (The Sanger Centre, Cambridge, UK) database {ue., those 
sequences including the designation "ENST")- Alternatively, the identification numbers in column 5 
may be derived from the NCBI RefSeq Nucleotide Sequence Records Database (i.e., those sequences 
including the designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i*. those 
sequences including the designation "NP"). Alternatively, the identification numbers in column 5 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 
stitching" algorithm For example, FL_^XXXXXJV, Jt 2 _YYYYY_NsJl< represents a "stitched" 
sequence in which XXXXXX is the identification number of the cluster of sequences to which the 
algorithm was applied, and YYYYY is the number of the prediction generated by the algorithm, and 
N w ._, if present, represent specific exons that may have been manually edited during analysis (See 
Example V). Alternatively, the identification numbers in column 5 may refer to assemblages of 
exons brought together by an "exon-stretching" algorithm For example, 
FLXXXXXX_gAAAAA_gBBBBB_l JV is the identification number of a "stretched" sequence, with 
XXXXXX being the Incyte project identification number, &AAAAA being the GenBank identification 
number of the human genomic sequence to which the "exon-stretching" algorithm was applied, 
gBBBBB being the GenBank identification number or NCBI RefSeq identification number of the 
nearest GenBank protein homolog, and N referring to specific exons (See Example V). In instances 
where a RefSeq sequence was used as a protein homolog for the "exon-stretching" algorithm, a 
RefSeq identifier (denoted by "NM," "NP," or "NT") may be used in place of the GenBank identifier 
(i.e., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example TV and Example V). 

I prefix | Type of analysis and/or examples of programs [ 



34 



WO 02/12340 



PCTAJS01/24217 



GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) orFGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
column 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant 
Incyte cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide 
sequences which were assembled using Incyte cDNA sequences. The representative cDNA library is 
the Incyte cDNA library which is most frequently represented by the Incyte cDNA sequences which 
were used to assemble and confirm the above polynucleotide sequences. The tissues and vectors 
which were used to construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses TRICH variants. A preferred TRICH variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TRICH amino acid sequence, and which contains at least one functional or 
structural characteristic of TRICH. 

The invention also encompasses polynucleotides which encode TRICH. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:31-60, which encodes TRICH The polynucleotide 
sequences of SEQ ID NO:31-60, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. In 
particular, such a variant polynucleotide sequence will have at least about 70% , or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TRICH. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 
NO:31-60 which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
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of SEQ ID NO:31-60. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of TRICH. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding TRICH, some bearing nunimal 
5 similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring TRICH, and all such variations are to be considered 

10 as being specifically disclosed. 

Although nucleotide sequences which encode TRICH and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TRICH 
or its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
15 occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding TRICH and its derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a greater 
20 half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 
TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
25 introduce mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO.31-60 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399^07; Kimmel, A.R. (1987) Methods Enzymol. 
30 152:507-5 11.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
35 Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
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combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 

5 (Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA 
sequencing system (Applied Biosystems), the MEGABACE 1000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA), or other systems known in the art The resulting sequences 
are analyzed using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. 
(1997) Short Protocols in Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, 

10 R.A. (1995) Molecular Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known hi the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 

15 DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 

Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments 

20 adjacent to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, 
M. et al. (1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme 
digestions and ligations may be used to insert an engineered double-stranded sequence into a region 
of unknown sequence before performing PCR. Other methods which may be used to retrieve 
unknown sequences are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 

25 19:3055-3060). Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries 
(Clontech, Palo Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries 
and is useful in finding intron/exon junctions. For all PCR-based methods, primers may be designed 
using commercially available software, such as OLIGO 4.06 primer analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 

30 length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
35 library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
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into 5* non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
5 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
10 which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode TRICH may be cloned in recombinant DNA molecules that direct expression of 
TRICH or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 
15 functionally equivalent amino acid sequence may be produced and used to express TRICH. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TRICH-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
20 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
25 5,837,458; Chang, C.-C. et al. (1999) Nat Biotechnol. 17:793-797; Christians, F.C et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
30 subjected to selection or screening procedures that identify those gene variants with the desired 

properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
35 optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
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homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
maimer. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in part, 
5 usmgchemicalmethodswellknownintheart (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 
Alternatively, TRICH itself or a fragment thereof may be synthesized using chemical methods. For 
example, peptide synthesis can be performed using various solution-phase or solid-phase techniques. 
(See, e.g., Creighton, T. (1984) Pjotefflg, Stores and Modular Properties, WH Freeman, New 
10 YorkNY, PP .55-60;andRoberge,J.Y.etal.(1995)Science269:202-204.) Automated synthesis 
may be achieved using the ABI 431A peptide synthesizer (Applied Biosystems). Additionally, the 
amino acid sequence of TRICH, or any part thereof, may be altered during direct synthesis and/or 
combined with sequences from other proteins, or any part thereof, to produce a variant polypeptide or 
a polypeptide having a sequence of a naturally occurring polypeptide. 
15 The peptide may be substantially purified by preparative high performance liquid 

chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, supra, pp. 28-53.) 

In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH 
20 or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in 
polynucleotide sequences encoding TRICH. Such elements may vary in their strength and specificity. 
25 Specific initiation signals may also be used to achieve more efficient translation of sequences 

encoding TRICH. Such signals include the ATG initiation codon and adjacent sequences, e.g. the 
Kozak sequence. In cases where sequences encoding TRICH and its initiation codon and upstream 
regulatory sequences are inserted into the appropriate expression vector, no additional transcriptional 
or translational control signals may be needed. However, in cases where only coding sequence, or a 
30 fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG 
initiation codon should be provided by the vector. Exogenous translational elements and initiation 
codons may be of various origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the particular host cell system used. (See, 
e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 
35 Methods which are well known to those skilled in the art may be used to construct expression 
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vectors containing sequences encoding TRICH and appropriate transcriptional and translation^ 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
5 al. (1995) Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding TRICH. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 

10 yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, 
or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra: Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 

15 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7: 1937-1945; Takamatsu, N. (1987) EMBO 
J. 6:307-311; The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New 
York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and 
Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 

20 delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 

Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Nad. Acad. Sci. 
USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

25 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding TRICH. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding TRICH into the vector's multiple 

30 cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and SM. Schuster (1989) J. Biol. 
Chem 264:5503-5509.) When large quantities of TRICH are needed, e.g. for the production of 

35 antibodies, vectors which direct high level expression of TRICH may be used. For example, vectors 
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containing the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TRICH- A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Sacgharomyces cerevisiae or Pichia pastoris . In addition, such 
5 vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences 
10 encoding TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-311). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) 
15 These constructs can be introduced into plant cells by direct DNA transformation or 

pathogen-mediated transfection. (See, e.g„ The MrGraw Hill Yearbook of Science and Technology 
(1992) McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated into 
20 an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
25 based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g.> Harrington, JJ. et al. (1997) Nat Genet 
30 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression 
of TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed 
into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
35 Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
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enriched media before being switched to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 
5 Any number of selection systems may be used to recover transformed ceU lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in * and apf cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
10 methotrexate; neo confers resistance to the aminoglycosides neomycin and G-U8; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Nad. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J.MoLBiol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
15 .Nad. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), B glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system 
(See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 
20 Although the presence/absence of marker gene expression suggests that the gene of interest is 

also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TRICH is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TRICH can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding TRICH under the control of a single 
25 promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TRICH and that express 
TRICH may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
30 amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-Unked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
35 fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
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monoclonal antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. (See, 
e.g., Hampton, R. et al. (1990) ^mlngical Meth "Hs, « Moratory Manual, APS Press, St. Paul MN, 
Sect. IV; Coligan, J.E. et al. (1997) rnrrent Protoco ls in Immunology. Greene Pub. Associates and 
5 Wiley-hterscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana 
Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TRICH 
10 include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
15 of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemUuminescent, or chromogenic 
agents, as well as substrates, cefaclors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 
20 conditions suitable for the expression and recovery of the protein from cell culture. The protein 

produced by a transformed cell may be secreted or retained intracellular^ depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode TRICH may be designed to contain signal sequences which 
direct secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 
25 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylate, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
30 Different host cells which have specific cellular machinery and characteristic mechanisms for 

post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
35 sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
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fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein and 
peptide moieties may also f acilitate purification of fusion proteins using commercially available 
5 affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc, and hemagglutinin (HA). GST, MBP. Trx, CBP, and 6-His enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and 
metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 
10 purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the TRICH encoding sequence and the heterologous protein 
sequence, so that TRICH may be cleaved away from the heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra, ch. 10). 
15 A variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
20 T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 
25 oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J.E. et «L (1991) r„m>nt Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TRICH 
30 binds, or to at least a fragment of the receptor, e.g., the Ugand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express TRICH, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Prosophila , or 
E . coli . Cells expressing TRICH or cell membrane fractions which contain TRICH are then contacted 
35 with a test compound and binding, stimulation, or inhibition of activity of either TRICH or the 
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compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 
5 solution or affixed to a solid support, and detecting the binding of TRICH to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

10 TRICH of the present invention or fragments thereof may be used to screen for compounds 

that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
TRICH activity, wherein TRICH is combined with at least one test compound, and the activity of 
TRICH in the presence of a test compound is compared with the activity of TRICH in the absence of 

15 the test compound. A change in the activity of TRICH in the presence of the test compound is 
indicative of a compound that modulates the activity of TRICH. Alternatively, a test compound is 
combined with an in vitro or cell-free system comprising TRICH under conditions suitable for 
TRICH activity, and the assay is performed. In either of these assays, a test compound which 
modulates the activity of TRICH may do so indirectly and need not come in direct contact with the 

20 test compound. At least one and up to a plurality of test compounds may be screened. 

hi another embodiment, polynucleotides encoding TRICH or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent Number 5,175,383 and U.S. Patent Number 

25 5,767,337.) For example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the 
early mouse embryo and grown in culture. The ES cells are transformed with a vector containing the 
,gene of interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; 
Capecchi, M.R. (1989) Science 244:1288-1292). The vector integrates into the corresponding region 
of the host genome by homologous recombination. Alternatively, homologous recombination takes 

30 place using the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage- 
specific manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic 
Acids Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 

35 heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
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therapeutic or toxic agents. 

Polynucleotides encoding TRICH may also be manipulated injofio in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
5 into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 

(1998) Science 282: 1 145-1 147). 

Polynucleotides encoding TRICH can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected 
10 sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 

15 THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TRICH and transporters and ion channels. In addition, the expression of TRICH 
is closely associated with brain, Uver, tumor, colon, thymus, small intestine, myometrium, testicular, 
bone marrow neuroblastoma tumor, parotid gland, lung, pituitary gland, and placental tissues, and 
20 Pompe's disease. Therefore, TRICH appears to play a role in transport, neurological, muscle, 
immunological, and cell proliferative disorders. In the treatment of disorders associated with 
increased TRICH expression or activity, it is desirable to decrease the expression or activity of 
TRICH. In the treatment of disorders associated with decreased TRICH expression or activity, it is 
desirable to increase the expression or activity of TRICH. 
25 Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

atoistered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, 
30 diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic 
periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia 
gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 
neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bmdyarrytbmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline 
35 myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, 
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ethanol myopathy, dennatomyositis, inclusion body myositis, infectious myositis, polymyositis, 
neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other 
disorders associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal 
5 neuropathy, sarcoidosis, sickle cell anemia, Wilson' s disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, 
Cushing's disease, Addison's disease, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital hom 
syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
10 neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
15 abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous 
system disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
20 nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
25 akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal 
degeneration, and familial frontotemporal dementia; a muscle disorder such as cardiomyopathy, 
myocarditis, Duchenne's muscular dystrophy, Becker's muscular dystrophy, myotonic dystrophy, 
central core disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
30 myopathy, infectious myositis, polymyositis, dennatomyositis, inclusion body myositis, thyrotoxic 
myopathy, ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, 
Cushing's syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, 
pheochromocytoma, and myopathies including encephalopathy, epilepsy, Keams-Sayre syndrome, 
lactic acidosis, myoclonic disorder, ophthalmoplegia, and acid maltase deficiency (AMD, also known 
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TRICH may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissues which express TRICH. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 

5 increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be adininistered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

10 combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRICH may be produced using methods which are generally known in the 
art. In particular, purified TRICH may be used to produce antibodies or to screen libraries of 

15 pharmaceutical agents to identify those which specifically bind TRICH. Antibodies to TRICH may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

20 For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 

and others may be immunized by injection with TRICH or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 

25 polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in 
humans, BCG (bacilli Calmette-Guerin) and Corvnebacte rium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 

30 fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
of TRICH amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to TRICH may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 

35 limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
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technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote, RX et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S JP. et al. (1984) Mol. Cell Biol. 62:109-120.) 

Li addition, techniques developed for the production of "chimeric antibodies," such as the 
5 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Nad. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce TRICH-specific single 
10 chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 
Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
15 disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TRICH may also be generated. 
For example, such fragments include, but are not limited to, F(ab% fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
20 the F(ab)2 fragments. Alternatively, Fab expression hbraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
25 polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering TRICH epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra) . 
30 Various methods such as Scatchard analysis in conjunction with radioimmunoassay 

techniques may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an 
association constant, K,, which is defined as the molar concentration of TRICH-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The K, determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
35 affinities for multiple TRICH epitopes, represents the average affinity, or avidity, of the antibodies 
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for TRICH. The K, determined for a preparation of monoclonal antibodies, which are monospecific 
for a particular TRICH epitope, represents a true measure of affinity. High-affinity antibody 
preparations with K n ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in 
which the TRICH-antibody complex must withstand rigorous manipulations. Low-affinity antibody 
5 preparations with K, ranging from about 10 6 to 10 7 L/mole are preferred for use in 

immunopurification and similar procedures which ultimately require dissociation of TRICH, 
preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practical 
A pproach. IRL Press, Washington DC; Liddell, J.E. and A. Oyer (1991) A Practical Guide to 
Monoclonal Antibodies . John Wiley & Sons, New York NY). 
10 The titer and avidity of polyclonal antibody preparations may be further evaluated to 

determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of TRICH-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 
15 guidelines for antibody quality and usage in various applications, are generally available. (See, e.g., 
Catty, supra , and Coligan et al. supra .) 

m another embodiment of the invention', the polynucleotides encoding TRICH, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, modifications 
of gene expression can be achieved by designing complementary sequences or antisense molecules 
20 (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene 
encoding TRICH. Such technology is well known inthe art, and antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding TRICH. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics, Humana Press Inc., 
TotawaNJ.) 

25 In therapeutic use, any gene delivery system suitable for introduction of the antisense 

sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intraceUularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 

30 9(13): 1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 

35 al. (1998) J. Pharm. Sci. 87(1 1): 1308-13 15; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
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25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TRICH may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) conect a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCE»-X1 disease characterized by X- 
5 linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:47(M75), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum Gene Therapy 6:667-703), thalassamias, familial 
10 hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor K deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 
15 (1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:11395-11399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi) . In the 
case where a genetic deficiency in TRICH expression or regulation causes disease, the expression of 
TRICH from an appropriate population of transduced cells may alleviate the clinical manifestations 
20 caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
25 ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H Recipon (1998) Curr. 
Opin. Biotechnol. 9:445^150). 

Expression vectors that may be effective for the expression of TRICH include, but are not 
30 limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 

(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TRICH 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), 
Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or 0-actin genes), (ii) an inducible 
35 promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. 
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Acad. Sci. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and 
H.M. Blau (1998) Curr. Opin. Biotechnol. 9:45 M56), commercially available in the T-REX plasmid 
(Invitrogen)); the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; 
Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter 
5 (Rossi, F.M.V. and Blau, H.M. supra) ), or (iii) a tissue-specific promoter or the native promoter of the 
endogenous gene encoding TRICH from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSACTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
10 parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, FX. and A. J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
15 respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i). the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
20 commercially available (Stratagene) and are based on published data (Riviere, L et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, MA. et al. (1987) J. Virol. 61:1639-1646; Adam, MA. and 
25 AD. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) I. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T- 
30 cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:47074716; 
Ranga, U. et al. (1998) Proc. Nad. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

35 In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
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polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with respect 
to the expression of TRICH. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) 
Annu. Rev. Nutr. 19:511-544 and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 
respect to the expression of TRICH. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HSV has 
a tropism The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art A replication^ompetent herpes simplex virus (HSV) type 1-based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 
HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
and ICP22. For HSV vectors, see also Coins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. 
(1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 
herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 

ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464^69). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
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(e.g., protease and polymerase). Similarly, inserting the coding sequence for TRICH into the 
alphavirus genome in place of the capsid-codihg region results in the production of a large number of 
TRICH-coding RNAs and the synthesis of high levels of TRICH in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 

5 persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et.al. (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of TRICH into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 

10 manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 

transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 
15 inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. 
and B.I. Carr, Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163- 
20 177.) A complementary sequence or antisense molecule may also be designed to block translation of 
mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
25 engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TRICH. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
30 corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
. oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
35 by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
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for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and mviyo transcription of DNA 
sequences encoding TRICH. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
constructs that synthesize complementary RNA, constitutively or inducibly, can be introduced into 

cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorotbioate or T O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICH. 
Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helk-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 
promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
TRICH expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeutically useful, and in the treatment of disorders 
associated with decreased TRICH expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of natoally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an invitro cell-free or reconstituted 
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biochemical system. Alterations in the expression of a polynucleotide encoding TRICH are assayed 
by any method commonly known in the art. Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus 

5 forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Srhizo^ccharomyces pombe gene expression 

10 system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids 
Res. 28-.E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 
Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oUgonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 

15 sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) US. 
Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo , in vitro, and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient 
20 Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 

Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
25 monkeys. 

An additional embodiment of the invention relates to the administration of a composition 
which generally comprises an active ingredient formulated with a pharmaceutical^ acceptable 
excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 

30 p^rtm* Pharm ^-ntical Sciences (Maack Publishing, Easton PA). Such compositions may 

consist of TRICH, antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of TRICH. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, 

35 enteral, topical, sublingual, or rectal means. 
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Compositions for pulmonary administration may be prepared in liquid or dry powder form 
These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
fast-acting formulations is well-known in the art In the case of macromolecules (e.g. larger peptides 
and proteins), recent developments in the field of pulmonary delivery via the alveolar region of the 
lung have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, 
J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration 
without needle injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TPJCH or fragments thereof. For example, liposome preparations 
containing a ceU-impermeable macromolecule may promote cell fusion and intracellular delivery of 
the macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic N- 
tenninal portion from the EHV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
TRICH or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of TRICH, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined 
by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED 50 (the dose therapeutically effective in 50% of the population) or IX> 50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD s0 /EDj 0 ratio. Compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED*, with Utile or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient and the route of administration. 
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The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and ad.ninistnttion are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combinations), reaction sensitivities, and 
response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 
or biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 fig to 100,000 //g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TRICH may be used for the 
diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for TRICH include methods which utilize the antibody and a label to detect TRICH 
in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without 
modification, and may be labeled by covalent or nonpatent attachment of a reporter molecule. A 
wide variety of reporter molecules, several of which are described above, are known in the art and 
may be used. 

A variety of protocols for measuring TRICH, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. Normal 
or standard values for TRICH expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to TRICH 
under conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TRICH expressed in 
subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of TRICH may be correlated 



59 



WO 02/12340 



PCTAJSOl/24217 



with disease. The diagnostic assay may be used to determine absence, presence, and excess 
expression of TRICH, and to monitor regulation of TRICH levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TRICH or closely related molecules may be used 
5 to identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5'regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding TRICH, allelic variants, or related 
sequences. 

10 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:31-60 or from 
genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 
15 cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes invitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
20 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes meUitus, diabetes 
25 insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 
30 cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
infectious myositis, polymyositis, neurological disorders associated with transport, e.g., Alzheimer's 
disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psychoses, and schizophrenia, and other disorders associated with transport, e.g., neurofibromatosis, 
35 postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia Wilson's disease, 
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cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, 
Menkes disease, occipital horn syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup 
5 disease, and Fanconi disease; a neurological disorder such as epilepsy, ischemic cerebrovascular 
disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, 
dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and 
other motor neuron disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary 
ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain 
10 abscess, subdural empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and 
radiculitis, viral central nervous system disease, prion diseases including kuru, Creutzfeldt-Jakob 
disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and 
metabolic diseases of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal 
hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation and other developmental 
15 disorders of the central nervous system including Down syndrome, cerebral palsy, neuroskeletal 
disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord diseases, 
muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, 
myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic 
20 disorders, seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, 
tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, 
progressive supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a 
muscle disorder such as cardiomyopathy, myocarditis, Duchenne's muscular dystrophy, Becker's 
muscular dystrophy, myotonic dystrophy, central core disease, nemaline myopathy, centronuclear 
25 myopathy, lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, 

dermatomyositis, inclusion body myositis, thyrotoxic myopathy, ethanol myopathy, angina, 
anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's syndrome, hypertension, 
hypoglycemia, myocardial infarction, migraine, pheochromocytoma, and myopathies including 
encephalopathy, epilepsy, Keams-Sayre syndrome, lactic acidosis, myoclonic disorder, 
30 ophthalmoplegia, and acid maltose deficiency (AMD, also known as Pompe's disease); an 

immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
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dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
5 myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a 
10 cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
15 bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. The polynucleotide 
sequences encoding TRICH may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered TRICH 
20 expression. Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a 
25 suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to 
a control sample then the presence of altered levels of nucleotide sequences encoding TRICH in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
30 monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
TRICH, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding TRICH, under conditions suitable for hybridization or 
35 amplification. Standard hybridization may be quantified by comparing the values obtained from 
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normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 

15 or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
TRICH may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced invitrg. Oligomers will preferably contain a fragment of a polynucleotide 
encoding TRICH, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
20 TRICH, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, 
oligonucleotide primers derived from the polynucleotide sequences encoding TRICH are used to 
amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 
30 fiomdiseasedornormdtissue,biopsysamples,bodUyflmds,andthelike. SNPs in the DNA cause 
differences in the secondary and tertiary structures of PCR products in single-stranded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fiSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
35 methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
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sequence of individual overlapping DNA fragments which assemble into a common consensus 
sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 

5 using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling 
or biotinylating nucleotides, ^amplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 

10 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 

15 can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The microarray may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 

20 activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and 
effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
phannacogenomic profile. 

25 In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH may 

be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 

30 gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
35 transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
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hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, 
biopsies, or other biological samples. The transcript image may thus reflect gene expression inviyo, 
as in the case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as lexicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
(2000) Toxicol. Lett. 112-113:467-471, expressly incorporated by reference herein). If a test 
compound has a signature similar to that of a compound with known toxicity, it is likely to share 
those toxic properties. These fingerprints or signatures are most useful and refined when they contain 
expression information from a large number of genes and gene families. Ideally, a genome-wide 
measurement of expression provides the highest quality signature. Even genes whose expression is 
not altered by any tested compounds are important as well, as the levels of expression of these genes 
are used to normalize the rest of the expression data. The normalization procedure is useful for 
comparison of expression data after treatment with different compounds. While the assignment of 
gene function to elements of a toxicant signature aids in interpretation of toxicity mechanisms, 
knowledge of gene function is not necessary for the statistical matching of signatures which leads to 
prediction of toxicity. (See, for example, Press Release 00-02 from the National Institute of 
Environmental Health Sciences, released February 29, 2000, available at 
http://www.niehs.nih.gov/oc/news/toxchip.htm) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
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pattern of protein expression in a particular tissue or cell type. Each protein component of a 
proteome can be subjected individuaUy to further analysis. Proteome expression patterns, or profiles, 
are analyzed by quantifying the number of expressed proteins and their relative abundance under 
given conditions and at a given time. A profile of a cell's proteome may thus be generated by 
5 separatmgandanalyzmgmepolypeptidesofaparticmartissueorcelltype. In one embodiment, the 
separation is achieved using twonlimensional gel electrophoresis, in which proteins from a sample are 
separated by isoelectric focusing in the first dimension, and then according to molecular weight by 
sodium dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, 
supra). The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by 
10 staining the gel with an agent such as Coomassie Blue or silver or fluorescent stains. The optical 
density of each protein spot is generally proportional to the level of the protein in the sample. The 
optical densities of equivalendy positioned protein spots from different samples, for example, from 
biological samples either treated or untreated with a test compound or therapeutic agent, are 
compared to identify any changes in protein spot density related to the treatment. The proteins in the 
15 spots are partially sequenced using, for example, standard methods employing chemical or enzymatic 
cleavage f ollowed by mass spectrometry. The identity of the protein in a spot may be determined by 
comparing its partial sequence, preferably of at least 5 contiguous amino acid residues, to the 
polypeptide sequences, of the present invention. In some cases, further sequence data may be 
obtained for definitive protein identification. 
20 A proteomic profile may also be generated using antibodies specific for TRICH to quantify 

the levels of TRICH expression. In one embodiment, the antibodies are used as elements on a 
microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem 
270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed 
25 by a variety of methods known in the art, for example, by reacting the proteins in the sample with a 
thiol- or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at 
each array element. 

Toxicant signatures at the proteome level are also useful for lexicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 

30 correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and I. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 
useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to 
rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in such 

35 cases. 
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In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological 

5 sample. A difference in the amount of protein between the two samples is indicative of a toxic 

response to the test compound in the treated sample. Individual proteins are identified by sequencing 
the amino acid residues of the individual proteins and comparing these partial sequences to the 
polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 

10 sample containing proteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 

15 sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
20 2155; and Heller, M J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are 
well known and thoroughly described in PNA Microarrays: A Practical Approach , M. Schena, ed. 
(1999) Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding TRICH may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
25 Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
- members of a multi-gene family may potentially cause undesired cross hybridization during 
chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
30 chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 

(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, JJ. 
et al. (1997) Nat. Genet 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. 
(1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be 
used to develop genetic linkage maps, for example, which correlate the inheritance of a disease state 
35 with the inheritance of a particular chromosome region or restriction fragment length polymorphism 
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(RFLP). (See, for example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353- 
7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
mapdata. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968.) Examples of genetic 

5 map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMM) World Wide Web site. Correlation between the location of the gene encoding TRICH on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

10 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 

15 localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to llq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

20 In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be fiee in solution, affixed to a 
solid support, borne on a cell surface, or located intraceUularly. The formation of binding complexes 
between TRICH and the agent being tested may be measured. 

25 Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with TRICH or fragments thereof, 
and washed. Bound TRICH is then detected by methods well known in the art. Purified TRICH can 

30 also be coated directly onto plates for use in the aforementioned drug screening techniques. 

Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding TRICH specifically compete with a test compound for binding TRICH. 
35 In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
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antigenic determinants with TRICH. 

In additional embodiments, the nucleotide sequences which encode TRICH may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
5 properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following embodiments are, 
therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

10 The disclosures of all patents, applications, and publications mentioned above and below, 

including U.S. Ser. No. 60/223,269, U.S. Ser. No. 60/224,456, U.S. Ser. No. 60/226,410, U.S. Ser. 
No. 60/228,140, U.S. Ser. No. 60/230,067, and U.S. Ser. No. 60/231,434, are hereby expressly 
incorporated by reference. 
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EXAMPLES 

L Construction of cDNA Libraries 

Ihcyte cDNAs were derived from cDNA libraries described in the LIFESEQ GOLD database 
(Incyte Genomics, Palo Alto CA) and shown in Table 4, column 5. Some tissues were homogenized 
and lysed in guanidinium isothiocyanate. while others were homogenized and lysed in phenol or in a 
suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of 
phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or 
extracted with chloroform RNA was precipitated from the lysates with either isopropanol or sodium 
acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
25 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A) + RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). .Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 
30 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
35 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
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appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography ( Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
5 PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINCY (fccyte Genomics, Palo Alto 
CA), or derivatives thereof. Recombinant plasmids were transformed into competent RcpU cells 
including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5o, DH10B, or ElectroMAX 
DH10B from Life Technologies. 
10 II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by inviyo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
15 QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 
plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
20 cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN H fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
HI. Sequencing and Analysis 
25 Incyte cDNA recovered in plasmids as described in Example H were sequenced as follows. 

Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system cDNA sequencing reactions were prepared 
30 using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
35 protocols and base calling software; or other sequence analysis systems known in the art. Reading 
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frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 

disclosed in Example VUL 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
5 vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM, and hidden Markov model (HMM)-based protein family 
10 databases such as PFAM. (HMM is a probabilistic approach which analyzes consensus primary 

structures of gene families. See, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 
The queries were performed using programs based on BLAST, FASTA BLIMPS, and HMMER. The 
Incyte cDNA sequences were assembled to produce full length polynucleotide sequences. 
Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or 
15 Genscan-predicted coding sequences (see Examples IV and V) were used to extend Incyte cDNA 
assemblages to full length. Assembly was performed using programs based on Phred, Phrap, and 
Consed, and cDNA assemblages were screened for open reading frames using programs based on 
GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive 
the corresponding full length polypeptide sequences. Alternatively, a polypeptide of the invention 
20 ,naybegmatanyofmemetmonmeresiduesofto Full length 

polypeptide sequences were subsequently analyzed by querying against databases such as the 
GenBank protein databases (genpept), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
and hidden Markov model (HMM)-based protein family databases such as PFAM. Full length 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software 
25 Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide 
and polypeptide sequence alignments are generated using default parameters specified by the 
CLUSTAL algorithm as incorporated into the MEGALIGN multisequence alignment program 
(DNASTAR), which also calculates the percent identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
30 Incyte cDNA and full length sequences and provides applicable descriptions, references, and 

threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 
references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
35 strength of a match between two sequences (the higher the score or the lower the probability value, 
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the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide 
and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO.31-60. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
5 and amplification technologies are described in Table 4, column 4. 

IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 
identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is 
a general-purpose gene identification program which analyzes genomic DNA sequences from a 
10 variety of organisms (See Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94, and Burge, C. and 
S. Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 
form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 
Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range of 
sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 
15 predicted cDNA sequences encode transporters and ion channels, the encoded polypeptides were 
analyzed by querying against PFAM models for transporters and ion channels. Potential transporters 
and ion channels were also identified by homology to Incyte cDNA sequences that had been 
annotated as transporters and ion channels. These selected Genscan-predicted sequences were then 
compared by BLAST analysis to the genpept and gbpri public databases. Where necessary, the 
20 Genscan-predicted sequences were then edited by comparison to the top BLAST hit from genpept to 
correct errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST analysis 
was also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted 
sequences, thus providing evidence for transcription. When Incyte cDNA coverage was available, 
this information was used to correct or confirm the Genscan predicted sequence. FuUlength 
25 polynucleotide sequences were obtained by assembling Genscan-predicted coding sequences with 
Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Example BX Alternatively, full length polynucleotide sequences were derived entirely from edited or 
unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 

30 "SrttphMl" Seq uences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 
D3 were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 

35 based on graph theory and dynamic programming to integrate cDNA and genomic information, 
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generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the entire length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, then all three intervals were considered to be equivalent This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 
Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
" genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA when necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example ffl were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for 
homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 
resultant stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:31-60 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm Sequences from these databases that matched 
SEQ ID NO:31-60 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
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bad been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome' s p- 
5 arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM 
distances are based on genetic markers mapped by Gen6thon which provide boundaries for radiation 
hybrid markers whose sequences were included in each of the clusters. Human genome maps and 
10 other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(htt P 7/ww.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
15 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel 

(1995) supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
20 much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 

computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 



25 5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
30 product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). K mere is more man one HSP, then me pan 

the product score. The product score represents a balance between fractional overlap and quality in a 
35 BLAST ahgnrnent. For example, a product score of 100 is produced only for 100% identity over the 
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entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 

identity and 100% overlap. 

Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 
tissue sources from which they were derived. For example, some full length sequences are 
assembled, at least in part, with overlapping Incyte cDNA sequences (see Example ID). Each cDN A 
sequence is derived from a cDNA library constructed from a human tissue. Each human tissue is 
classified into one of the following organ/tissue categories: cardiovascular system; connective tissue; 
digestive system; embryonic structures; endocrine system; exocrine glands; genitalia, female; 
genitalia, male; germ cells; hemic and immune system; liver; musculoskeletal system; nervous 
system; pancreas; respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; 
or urinary tract. The number of libraries in each category is counted and divided by the total number 
of libraries across all categories. Similarly, each human tissue is classified into one of the following 
i disease/condition categories: cancer, cell line, developmental, inflammation, neurological, trauma, 
cardiovascular, pooled, and other, and the number of libraries in each category is counted and divided 
by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding TRICH. cDNA sequences and cDNA library/tissue 
information are found in the LIFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 
3 Vm. Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment. One 
primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 
synthesized to initiate 3' extension of the known fragment. The initial primers were designed using 
5 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 
sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
30 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 mnol of each primer, reaction buffer containing Mg* (NH») 2 S0 4 , 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), BLONGASE enzyme 
35 (Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
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pair PQ A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 

5 Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 \i\ PICOGREBN 
quantitation reagent (0.25% (v/v) PICOGREBN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 pi of undiluted PCR product into each well of an opaque fluorimeter plate (Coming Costar, 
Acton MA), allowing the DNA to bind to the reagent The plate was scanned in a Fluoroskan H 

10 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 iA to 10 (A aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

15 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

20 Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent Ecoli cells. Transformed cells were selected on 
antibiotic^ontaining media, and individual colonies were picked and cultured overnight at 37 °C in 
384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

25 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted 

30 with 20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5' regulatory sequences using the above procedure along with oligonucleotides 
35 designed for such extension, and an appropriate genomic library. 



76 



WO 02/12340 



PCT7US01/24217 



10 



IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO.31-60 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /iCi of 
[ Y - 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
Ase I, Bgl D, Eco RI, Pst I, Xba L or Pvu H (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
15 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

20 The linkage or synthesis of array elements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra.), 
mechanical microspomng technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
supra). Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 

25 Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 

30 Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 

35 biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
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After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Q» Sample Pr eparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 ptfd oligo-(dT) primer (21mer), IX 
first strand buffer, 0.03 units/ M l RNase inhibitor, 500 fM dATP, 500 »M dGTP, 500 fiA dTTP, 40 
fM dCTP, 40 fiM dCTP-Cy3 (BDS) or dCTP-CyS (Amersham Phannacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by invitro transcription 
from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 /d 5X SSC/0.2% SDS. 
Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element 
is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
«. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 
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110°Coven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 fd of the array element DNA, at an average 
concentration of 100 ng/ M l, is loaded into the open capillary printing element by a high-speed robotic 

5 apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60" C followed by washes in 

10 0.2% SDS and distilled water as before. 
Hybridisation 

Hybridization reactions contain 9 ;d of sample mixture consisting of 0.2 fig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65°C for 5 minutes and is aliquoted onto the microarray surface and covered 

15 wimanlWcoverslip. The arrays are transferred to a waterproof chamber having a cavity just 
slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 /il of 5X SSC in a comer of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60° C. The arrays are washed for 10 min at 45" C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X 

20 SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5 . The excitation laser tight is 
25 focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 

resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
30 Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
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although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
5 locationtobecoirelatedwithaweightratioofhybridizingspeciesof 1:100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 
two fluorophores and adding identical amounts of each to the hybridization mixture. 
10 The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
15 measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 
20 software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
XL Complementary Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring TRICH. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
25 procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of TRICH. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the TRICH-encoding 
30 transcript. 

XII. Expression of TRICH 

Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
35 transcription. Examples of such promoters include, but are not limited to, the trp-lac (toe) hybrid 
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promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isopropyl beta-D- • 
thiogalactopyranoside (JPTG). Expression of TRICH in eukaryotic cells is achieved by infecting 

5 insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TRICH by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 

10 infect Spodo ptera frueiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, TRICH is synthesized as a fusion protein with, e.g., glutathione 

15 S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 

20 TRICH at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 

purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10 and 16). Purified TRICH obtained by these methods can be used direcdy in the assays 

25 shown in Examples XVI, XVIL and XVIII, where applicable. 
Xm. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 

30 include PCMV SPORT (Life Technologies) and PCR3.1 (Invitrogen, Carlsbad CA), both of which 
contain the cytomegalovirus promoter. 5-10 jug of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 ng of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 
35 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
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recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. How cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
5 fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward light scatter and 90 degree side light 
scatter, down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 
10 specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein^onjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994) Flow Cytometry, Oxford, New York NY. 

The influence of TRICH on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding TRICH and either CD64 or CD64-GFP. 
15 CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding TRICH and other genes of interest can be analyzed by 
20 northern analysis or microarray techniques. 

XIV. Production of TRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-195), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
25 Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) 
30 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 43 1 A 

peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-naaleiimdobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra.) Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for 
35 antipeptide and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate, 
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blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat 
anti-rabbit IgG. 

XV. Purification of Naturally Occurring TRICH Using Specific Antibodies 

Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 
5 chromatography using antibodies specific for TRICH. An immunoaffinity column is constructed by 
covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing TRICH are passed over the immunoaffinity column, and the column is 
10 washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and TRICH is collected. 
XVI. Identification of Molecules Which Interact with TRICH 
15 Molecules which interact with TRICH may include transporter substrates, agonists or 

antagonists, modulatory proteins such as Gfr proteins (Reimann, supra) or proteins involved in 
TRICH localization or clustering such as MAGUKs (Craven, supra). TRICH, or biologically active 
fragments thereof, are labeled with 1JS I Bolton-Hunter reagent. (See, e.g., Bolton A.E. and W.M. 
Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a 
20 multi-well plate are incubated with the labeled TRICH, washed, and any wells with labeled TRICH 
complex are assayed. Data obtained using different concentrations of TRICH are used to calculate 
values for the number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
(Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 
25 fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion 
protein and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute a 
transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems are 
commercially available, and methods for use of the yeast 2-hybrid system with ion channel proteins 
30 are discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzymol. 293: 104-122). 

TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
. which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

35 Potential TRICH agonists or antagonists may be tested for activation or inhibition of TRICH 
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ion channel activity using the assays described in section XVIII. 
XVII. Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 
conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
5 or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the art. A second plasmid which expresses any one of a number of marker genes, such as B- 
galactosidase, is co-transformed into the cells to allow rapid identification of those cells which have 
taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after 
10 transformation under conditions appropriate for the cell line to allow expression and accumulation of 
TRICH and 6-galactosidase. 

Transformed cells expressing 6-galactosidase are stained blue when a suitable colorimetric 
substrate is added to the culture media under conditions that are well known in the art. Stained cells 
arc tested for differences in membrane conductance by electrophysiological techniques that are well 
15 known in the art Untransformed cells, and/or cells transformed with either vector sequences alone or 
6-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing TRICH 
will have higher anion or cation conductance relative to control cells. The contribution of TRICH to 
conductance can be confirmed by incubating the cells using antibodies specific for TRICH The 
antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in the ion channel, 
20 and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
containing Xenonus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi et 
al., supra; Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 
appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 
25 mature stage IV oocytes. Injected oocytes are incubated at 18 °C for 1-5 days. Inside-out 

macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations 
of the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCU where appropriate. 
Electrode resistance is set at 2-5 Mfl and electrodes are filled with the intracellular solution lacking 
30 mediator. Experiments are performed at room temperature from a holding potential of 0 mV. 

Voltage ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. Current 
measured is proportional to the activity of TRICH in the assay. 

In particular, the activity of TRICH-20 is measured as Ca 2+ conductance, the activity of 
TRICH-22 is measured as Q- conductance in the presence of glycine, the activity of TRICH-23 is 
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measured as Ca 2+ conductance, and the activity of TRICH-24 is measured as K + conductance in the 
presence of Ca 2+ , and the activity of TRICH-26 is measured as cation conductance in the presence of 
heat. 

• Transport activity of TRICH is assayed by measuring uptake of labeled substrates substrates 
5 (including but not limited to, maltose, glucose, or glycogen) into Xenopus laevis oocytes. Oocytesat 
stages V and VI are injected with TRICH mRNA (10 ng per oocyte) and incubated for 3 days at 18°C 
in OR2 medium (82.5mMNaCl, 2.5 mMKCl, lmMCaCl* lmMMgCl 2 , ImM N a2 HP0 4 , 5 mM 
Hepes, 3.8 mM NaOH , 50/ig/ml gentamycin, pH 7.8) to allow expression of TRICE Oocytes are 
then transferred to standard uptake medium (lOOmM NaCl, 2 mM KC1, ImM CaCl 2 , ImM MgCl 2 , 10 
10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., amino acids, sugars, drugs, ions, and 
neurotransmitters) is initiated by adding labeled substrate (e.g. radiolabeled with 3 H fluorescently 
labeled with rhodamine, etc.) to the oocytes. After incubating for 30 minutes, uptake is terminated by 
washing the oocytes three times in Na + -free medium, measuring the incorporated label, and 
comparing with controls. TRICH activity is proportional to the level of internalized labeled substrate. 
15 In particular, test substrates include sulfate for TRICH-13, tricarboxylates for TRICH-21, 

dicarboxylates and Na + for TRICH-25, ornithine for TRICH-27, and monocarboxylates for TRICH- 
28. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 
[Y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of the 
20 recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and varying 
amounts of TRICH in a suitable buffer incubated at 37 °C for a suitable period of time. The reaction 
is terminated by acid precipitation with trichloroacetic acid and then neutralized with base, and an 
aliquot of the reaction mixture is subjected to membrane or filter paper-based chromatography to 
separate the reaction products. The amount of 32 P liberated is counted in a scintillation counter. The 
25 amount of radioactivity recovered is proportional to the ATPase activity of TRICH in the assay. 
XVHI. Identification of TRICH Agonists and Antagonists 

TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 
(Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in the 
presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using 
30 patch clamp methods well known in the art or as described in Example XVtt Alternatively, ion 
channel activity is assayed using fluorescent techniques that measure ion flux across the cell 
membrane (Velicelebi, G. et al. (1999) Meth. Enzymol. 294:2047; West, M.R. and C.R. Molloy 
• (1996) Anal. Biochem. 241:51-58). These assays may be adapted for high-throughput screening 
using microplates. Changes in internal ion concentration are measured using fluorescent dyes such as 
35 the Ca 2+ indicator Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CI" 
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indicator MQAE (all available from Molecular Probes) in combination with the FLIPR fluorimetric 
plate reading system (Molecular Devices). In a more generic version of this assay, changes in 
membrane potential caused by ionic flux across the plasma membrane are measured using oxonyl 
dyes such as DiBAQ (Molecular Probes). DiBAC 4 equilibrates between the extracellular solution 
and cellular sites according to the cellular membrane potential. The dye's fluorescence intensity is 
20-fold greater when bound to hydrophobic intracellular sites, allowing detection of DiB AQ entry 
into the cell (Gonzalez, J.E. and P.A. Negulescu (1998) Curr. Opin. Biotechnol. 9:624-631). 
Candidate agonists or antagonists may be selected from known ion channel agonists or antagonists, 
peptide libraries, or combinatorial chemical libraries. 



10 



Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 
invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
15 embodiments. Indeed, various modifications Of the described modes for carrying out the invention 
which are obvious to those skilled in molecular biology or related fields are intended to be within the 
scope of the following claims. 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting of 
5 SEQIDNO:1-30, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO:1-30, 

c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-30, and 

10 d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO: 1-30. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 



30. 

15 



3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 



20 

NO:31-60. 



5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 



6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of 
claim 1, and 
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b) recovering the polypeptide so expressed. 

10. An isolated antibody which specifically binds to a polypeptide of claim 1. 

1 1. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting 
ofSEQIDNO:31-60, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SBQ ID NO:31-60, 

c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1. 

13. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 11, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 
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16. A composition comprising a polypeptide of claim 1 and a pharmaceutical^ acceptable 
excipient. 

17. A composition of claim 16, wherein the polypeptide has an amino acid sequence selected 
5 from the group consisting of SEQ ID NO: 1-30. 

18. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising afcninistering to a patient in need of such trealment the composition 
of claim 16. 

10 

19. A method of screening a compound for effectiveness as an agonist of a polypeptide of 

claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

15 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutically acceptable excipient 

21. A method for treating a disease or condition associated with decreased expression of 
20 functional TRICH, comprising administering to a patient in need of such treatment a composition of 

claim 20. 

22. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 

claim 1, the method comprising: 
25 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

23. A composition comprising an antagonist compound identified by a method of claim 22 
and a pharmaceutically acceptable excipient 

30 

24. A method for treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment a composition of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 



135 



WO 02/12340 



PCTAJS01/24217 



1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 

conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
5 compound that specifically binds to the polypeptide of claim 1 . 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1 , the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
10 permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 

15 in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 

27. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

20 comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying amounts 
25 of the compound and in the absence of the compound. 

28. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
30 least 20 contiguous nucleotides of a polynucleotide of claim 11 under conditions whereby a specific 

hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

35 d) comparing the amount of hybridization complex in the treated biological sample with the 
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amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 

5 29. A diagnostic test for a condition or disease associated with the expression of TRICH in a 

biological sample, the method comprising: 

' a) combining the biological sample with an antibody of claim 10, under conditions suitable 
for the antibody to bind the polypeptide and form an antibody:polypeptide complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the presence 
10 of the polypeptide in the biological sample. 

30. The antibody of claim 10, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 
15 c) a Fab fragment, 

d) a F(ab') 2 fragment, or 

e) a humanized antibody. 



20 



31. A composition comprising an antibody of claim 10 and an acceptable excipient. 

32. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising aciministering to said subject an effective amount of the composition of claim 
31. 

25 33. A composition of claim 31, wherein the antibody is labeled. 

34. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
33. 



30 



35. A memod of prepamg a polyclonal antib^ 

10, the method comprising: 

a) inmiunizing an animal with a polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID NO.1-30, or an immunogenic fragment thereof, under conditions to 
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elicit an antibody response, 

b) isolating antibodies from said animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which binds specifically to a polypeptide having an amino acid sequence selected from the 

5 group consisting of SEQ ID NO: 1-30. 

36. An antibody produced by a method of claim 35. 

37. A composition comprising the antibody of claim 36 and a suitable carrier. 

10 

38. A method of making a monoclonal antibody with the specificity of the antibody of claim 
10, the method comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-30, or an immunogenic fragment thereof, under conditions to 

15 elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal antibody- 
producing hybridoma cells, 

d) culturing the hybridoma cells, and 

20 e) isolating from the culture monoclonal antibody which binds specifically to a polypeptide 

having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30. 

39. A monoclonal antibody produced by a method of claim 38. 

25 40. A composition comprising the antibody of claim 39 and a suitable carrier. 

41. The antibody of claim 10, wherein the antibody is produced by screening a Fab 
expression library. . 

. 30 42. The antibody of claim 10, wherein the antibody is produced by screening a recombinant 

immunoglobulin library. 

43. A method of detecting a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-30 in a sample, the method comprising: 
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a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-30 in 

5 the sample. 

44. Amemtrfofpurifymgapolypeptidehavrnga^ 
group consisting of SEQ ID NO:1-30 from a sample, the method comprising: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
10 binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ED NO: 1-30. 



15 



20 



25 



30 



45. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

46. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

47. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:3. 

48. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:4. 

49. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

50. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ED NO:6. 

51. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:7. 

52. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:8. 

53. A polypeptide of claim 1. comprising the amino acid sequence of SEQ ED NO:9. 

54. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO: 10. 

55. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO: 1 1. 
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56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:15. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:16. 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:17. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:18. 
15 63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:19. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 



20 
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73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 

5 75. Apolynucleotideofclaimll.comprisingthepolynucleotidesequenceofSEQID 

NO:31. 

76. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:32. 

10 

77. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:33. 

78. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

15 NO:34. 

79. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:35. 

20 80. Apolynucleotideofclaimll,comprisingthepolynucleotidesequenceofSEQID 

NO:36. 

81. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:37. 

25 

82. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:38. 

83. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

30 NO:39. 

84. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:40. 
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85. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:41. 

86. A polynucleotide of claim 1 1. comprising the polynucleotide sequence of SEQ ED 

5 NO:42. 

87. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ED 

NO:43. 

10 88. ApolynucleotideofclaimU.comprisingthepolynucleotidesequenceofSEQID 

NO:44. 

89. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ED 

NO:45. 

15 

90. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ED 

NO:46. 

91. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ED 

20 NO:47. 

92. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ED 

NO:48. 

25 93. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ED 

NO:49. 

94. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ED 

NO:50. 

30 

95. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ED 

NO:51. 

96. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ED 
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NO:52. 

97. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:53. 

5 

98. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:54. 

99. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

10 NO:55. 

100. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:56. 

15 101. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:57. 

102. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:58. 

20 

103. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

104. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

25 NO:60. 
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<120> TRANSPORTERS AND ION CHANNELS 

<130> PI-0183 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/223,269; 60/224,456; 60/226,410; 60/228,140; 60/230,067; 

<151> 2000-08-03; 2000-08-10; 2000-08-18; 2000-08-25; 2000-08-31; 
2000-09-08 

<160> 60 

<170> PERL Program 

<210> 1 

<211> 308 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2194064CD1 

<400> 1 , 
Met Val Gly Gly Val Leu Ala Ser Leu Gly Phe Val Phe Ser Ala 
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15 1° 15 

Phe Ala Ser Asp Leu Leu His Leu Tyr Leu Gly Leu Gly Leu Leu 

20 25 
Ala Gly Phe Gly Trp Ala Leu val Phe Ala Pro Ala Leu Gly Thr 



Leu Ser Arg Tyr Phe Ser Arg Arg Arg Val Leu Ala Val Gly Leu 

Ala Leu Thr Gly Asn Gly Ala Ser Ser Leu Leu Leu Ala Pro Ala 

65 70 
Leu Gin Leu Leu Leu Asp Thr Phe Gly Trp Arg Gly Ala Leu Leu 

80 " 
Leu Leu Gly Ala lie Thr Leu His Leu Thr Pro Cys Gly Ala Leu 
95 100 AUJ 

Leu Leu Pro Leu Val Leu Pro Gly Asp Pro Pro Ala Pro Pro Arg 

110 1" 
Ser Pro Leu Ala Ala Leu Gly Leu Ser Leu Phe Thr Arg Arg Ala 

125 x " 
Phe Ser lie Phe Ala Leu Gly Thr Ala Leu Val Gly Gly Gly Tyr 

140 1^5 XJU 

Phe Val Pro Tyr Val His Leu Ala Pro His Ala Leu Asp Arg Gly 

155 1^0 - L " 

Leu Gly Gly Tyr Gly Ala Ala Leu Val Val Ala Val Ala Ala Met 

170 1/3 
Gly Asp Ala Gly Ala Arg Leu Val Cys Gly Trp Leu Ala Asp Gin 

185 190 
Gly Trp Val Pro Leu Pro Arg Leu Leu Ala Val Phe Gly Ala Leu 

200 205 ^ x 

Thr Gly Leu Gly Leu Trp Val Val Gly Leu Val Pro Val Val Gly 

Gly Glu Glu Ser . TrP Gly Gly Pro Leu Leu Ala Ala Ala Val Ala 

230 235 
Tyr Gly Leu Ser Ala Gly Ser Tyr Ala Pro Leu Val Phe Gly Val 

245 

Leu Pro Gly Leu Val Gly Val Gly Gly Val Val Gin Ala Thr Gly 

260 265 _ „ 

Leu Val Met Met Leu Met Ser Leu Gly Gly Leu Leu Gly Pro Pro 
275 280 ZOJ 

Leu Ser Gly Lys Asp Leu Ser Ser Gin lie Cys Leu Gin Leu Ser 



290 

Ser Ala Pro Gly Val Arg Gly Phe 
305 



<210> 2 

<211> 606 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 2744094CD1 



iTlll Glu Gin Leu Ser Gin Gin Leu Pro Arg Thr Cys Leu Trp 

i 5 10 

His Leu Tyr lie Thr Thr Val Ser Leu Pro Gly Tyr Met Val Ser 
20 25 JU 

Cys lie lie Phe Phe Phe Val Val Pro lie Val Phe Leu Thr lie 
35 40 * 3 

Phe Ser Phe Trp Trp Leu Ser Tyr Trp Leu Glu Gin Gly Ser Gly 

Thr Asn Ser Ser Arg Glu Ser Asn Gly Thr Met Ala Asp Leu Gly 

65 70 
Asn lie Ala Asp Asn Pro Gin Leu Ser Phe Tyr Gin Leu Val Tyr 
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80 85 90 

Gly Leu Asn Ala Leu Leu Leu He Cys Val Gly Val Cys Ser Ser 
95 100 105 

Gly He Phe Thr Lys Val Thr Arg Lys Ala Ser Thr Ala Leu His 
110 115 120 

Asn Lys Leu Phe Asn Lys Val Phe Arg Cys Pro Met Ser Phe Phe 
125 130 135 

Asp Thr He Pro He Gly Arg Leu Leu Asn Cys Phe Ala Gly Asp 
140 145 150 

Leu Glu Gin Leu Asp Gin Leu Leu Pro He Phe Ser Glu Gin Phe 
155 160 165 

Leu Val Leu Ser Leu Met Val He Ala Val Leu Leu He Val Ser 
170 175 180 

Val Leu Ser Pro Tyr He Leu Leu Met Gly Ala He He Met Val 
185 190 195 

lie Cys Phe He Tyr Tyr Met Met Phe Lys Lys Ala He Gly Val 
200 205 210 

Phe Lys Arg Leu Glu Asn Tyr Ser Arg Ser Pro Leu Phe Ser His 
215 220 225 

He Leu Asn Ser Leu Gin Gly Leu Ser Ser He His Val Tyr Gly 
230 235 240 

Lys Thr Glu Asp Phe He Ser Gin Phe Lys Arg Leu Thr Asp Ala 
245 250 255 

Gin Asn Asn Tyr Leu Leu Leu Phe Leu Ser Ser Thr Arg Trp Met 
260 265 270 

Ala Leu Arg Leu Glu He Met Thr Asn Leu Val Thr Leu Ala Val 
275 280 285 

Ala Leu Phe Val Ala Phe Gly He Ser Ser Thr Pro Tyr Ser Phe 
290 295 300 

Lys Val Met Ala Val Asn He Val Leu Gin Leu Ala Ser Ser Phe 
305 310 315 

Gin Ala Thr Ala Arg He Gly Leu Glu Thr Glu Ala Gin Phe Thr 
320 325 330 

Ala Val Glu Arg He Leu Gin Tyr Met Lys Met Cys Val Ser Glu 
335 340 345 

Ala Pro Leu His Met Glu Gly Thr Ser Cys Pro Gin Gly Trp Pro 
350 355 360 

Gin His Gly Glu He He Phe Gin Asp Tyr His Met Lys Tyr Arg 
365 370 375 

Asp Asn Thr Pro Thr Val Leu His Gly He Asn Leu Thr He Arg 
380 385 390 

Glv His Glu Val Val Gly He Val Gly Arg Thr Gly Ser Gly Lys 
395 " 400 405 

Ser Ser Leu Gly Met Ala Leu Phe Arg Leu Val Glu Pro Met Ala 
410 415 420 

Gly Arg He Leu He Asp Gly Val Asp He Cys Ser He Gly Leu 
425 430 435 

Glu Asp Leu Arg Ser Lys Leu Ser Val He Pro Gin Asp Pro Val 
440 445 450 

Leu Leu Ser Gly Thr He Arg Phe Asn Leu Asp Pro Phe Asp Arg 
455 460 465 

His Thr Asp Gin Gin He Trp Asp Ala Leu Glu Arg Thr Phe Leu 
470 475 480 

Thr Lys Ala lie Ser Lys Phe Pro Lys Lys Leu His Thr Asp Val 
485 490 495 

Val Glu Asn Gly Gly Tyr Phe Ser Val Gly Glu Arg Gin Leu Leu 
500 505 510 

Cys He Ala Arg Ala Val Leu Arg Asn Ser Lys He He Leu He 
515 520 525 

Asp Glu Ala Thr Ala Ser He Asp Met Glu Thr Asp Thr Leu He 
530 535 540 

Gin Arg Thr He Arg Glu Ala Phe Gin Gly Cys Thr Val Leu Val 
545 550 555 
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He Ala His Arg Val Thr Thr Val Leu Asn Cys Asp Arg He Leu 



560 



565 570 
Val Met Gly Asn Gly Lys Val Val Glu Phe Asp Arg Pro Glu Val 
575 580 D0:> 

Leu Arg Lys Lys Pro Gly Ser Leu Phe Ala Ala Leu Met Ala Thr 
590 595 600 

Ala Thr Ser Ser Leu Arg 
605 

<210> 3 

<211> 1642 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2798241CD1 

Me2°ser Thr Ala He Arg Glu Val Gly Val Trp Arg Gin Thr Arg 
1 5 1° 15 

Thr Leu Leu Leu Lys Asn Tyr Leu He Lys Cys Arg Thr Lys Lys 

20 25 30 

Ser Ser Val Gin Glu He Leu Phe Pro Leu Phe Phe Leu Phe Trp 

35 40 4i 

Leu He Leu He Ser Met Met His Pro Asn Lys Lys Tyr Glu Glu 

50 55 60 

Val Pro Asn He Glu Leu Asn Pro Met Asp Lys Phe Thr Leu Ser 

65 70 
Asn Leu He Leu Gly Tyr Thr Pro Val Thr Asn He Thr Ser Ser 

80 85 9° 

He Met Gin Lys Val Ser Thr Asp His Leu Pro Asp Val He He 

95 100 ±vo 

Thr Glu Glu Tyr Thr Asn Glu Lys Glu Met Leu Thr Ser Ser Leu 
110 H5 I 20 

Ser Lys Pro Ser Asn Phe Val Gly Val Val Phe Lys Asp Ser Met 

125 130 x 

Ser Tyr Glu Leu Arg Phe Phe Pro Asp Met He Pro Val Ser Ser 

140 I 45 i E ,u 

He Tyr Met Asp Ser Arg Ala Gly Cys Ser Lys Ser Cys Glu Ala 

155 160 J-" 3 

Ala Gin Tyr Trp Ser Ser Gly Phe Thr Val Leu Gin Ala Ser lie 

170 I" 75 ■ L0U 

Asp Ala Ala He He Gin Leu Lys Thr Asn Val Ser Leu Trp Lys 

185 190 I 95 

' Glu Leu Glu Ser Thr Lys Ala Val He Met Gly Glu Thr Ala Val 

200 205 210 

Val Glu He Asp Thr Phe Pro Arg Gly Val He Leu He Tyr Leu 

215 220 225 

Val He Ala Phe Ser Pro Phe Gly Tyr Phe Leu Ala He His lie 

230 235 240 

Val Ala Glu Lys Glu Lys Lys He Lys Glu Phe Leu Lys He Met 

245 250 25b 

Gly Leu His Asp Thr Ala Phe Trp Leu Ser Trp Val Leu Leu Tyr 

260 265 
Thr Ser Leu He Phe Leu Met Ser Leu Leu Met Ala Val He Ala 

275 280 285 

Thr Ala Ser Leu Leu Phe Pro Gin Ser Ser Ser He Val He Phe 

290 295 300 

Leu Leu Phe Phe Leu Tyr Gly Leu Ser Ser Val Phe Phe Ala Leu 

305 31° 315 

Met Leu Thr Pro Leu Phe Lys Lys Ser Lys His Val Gly He Val 

320 32 5 JJU 
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Glu Phe Phe Val Thr Val Ala Phe Gly Phe lie Gly Leu Met lie 

335 340 
He Leu lie Glu Ser Phe Pro Lys Ser Leu Val Trp Leu Phe Ser 

350 355 
Pro Phe Cys His Cys Thr Phe Val He Gly lie Ala Gin Val Met 

365 ^70 
His Leu Glu Asp Phe Asn Glu Gly Ala Ser Phe Ser Asn Leu Thr 

380 385 
Ala Gly Pro Tyr Pro Leu lie lie Thr lie He Met Leu Thr Leu 

395 400 ^ UJ 

Asn Ser He Phe Tyr Val Leu Leu Ala Val Tyr Leu Asp Gin Val 

410 415 
He Pro Gly Glu Phe Gly Leu Arg Arg Ser Ser Leu Tyr Phe Leu 

425 430 
Lys Pro Ser Tyr Trp Ser Lys Ser Lys Arg Asn Tyr Glu Glu Leu 

Ser Glu Gly Asn Val Asn Gly Asn He Ser Phe Ser Glu He He 
455 460 <lDO 

Glu Pro Val Ser Ser Glu Phe Val Gly Lys Glu Ala lie Arg He 



470 



475 480 
Ser Gly He Gin Lys Thr Tyr Arg Lys Lys Gly Glu Asn Val Glu 

485 490 
Ala Leu Arg Asn Leu Ser Phe Asp He Tyr Glu Gly Gin He Thr 

500 505 
Ala Leu Leu Gly His Ser Gly Thr Gly Lys Ser Thr Leu Met Asn 

515 52 ^ ^ 

He Leu Cys Gly Leu Cys Pro Pro Ser Asp Gly Phe Ala Ser He 

530 535 
Tyr Gly His Arg Val Ser Glu He Asp Glu Met Phe Glu Ala Arg 
u 545 550 0DD 

Lys Met He Gly He Cys Pro Gin Leu Asp He His Phe Asp Val 

560 565 
Leu Thr Val Glu Glu Asn Leu Ser He Leu Ala Ser He Lys Gly 

He Pro Ala Asn Asn He He Gin Glu Val Gin Lys Val Leu Leu 

590 595 600 

Asp Leu Asp Met Gin Thr He Lys Asp Asn Gin Ala Lys Lys Leu 

605 610 - 

Ser Gly Gly Gin Lys Arg Lys Leu Ser Leu Gly He Ala Val Leu 

620 DJU 
Gly Asn Pro Lys He Leu Leu Leu Asp Glu Pro Thr Ala Gly Met 

635 ^40 
Asp Pro Cys Ser Arg His He Val Trp Asn Leu Leu Lys Tyr Arg 

Lys Ala Asn Arg Val Thr Val Phe Ser Thr His Phe Met Asp Glu 

665 670 
Ala Asp lie Leu Ala Asp Arg Lys Ala Val He Ser Gin Gly Met 

680 685 
Leu Lys Cys Val Gly Ser Ser Met Phe Leu Lys Ser Lys Trp Gly 

695 700 
He Gly Tyr Arg Leu Ser Met Tyr He Asp Lys Tyr Cys Ala Thr 

710 71 5 ' zu 

Glu Ser Leu Ser Ser Leu Val Lys Gin His He Pro Gly Ala Thr 

725 7 30 
Leu Leu Gin Gin Asn Asp Gin Gin Leu Val Tyr Ser Leu Pro Phe 

740 745 I 

Lys Asp Met Asp Lys Phe Ser Gly Leu Phe Ser Ala Leu Asp Ser 

755 760 765 

His Ser Asn Leu Gly Val He Ser Tyr Gly Val Ser Met Thr Thr 

770 • 775 780 

Leu Glu Asp Val Phe Leu Lys Leu Glu Val Glu Ala Glu He Asp 

785 

Gin Ala Asp Tyr Ser Val Phe Thr Gin Gin Pro Leu Glu Glu Glu 
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800 805 810 

Met Asp Ser Lys Ser Phe Asp Glu Met Glu Gin Ser Leu Leu lie 
815 820 825 

Leu Ser Glu Thr Lys Ala Ser Leu Val Ser Thr Met Ser Leu Trp 
830 835 840 

Lys Gin Gin Met Tyr Thr lie Ala Lys Phe His Phe Phe Thr Leu 
845 850 855 

Lys Arg Glu Ser Lys Ser Val Arg Ser Val Leu Leu Leu Leu Leu 
860 865 870 

He Phe Phe Thr Val Gin He Phe Met Phe Leu Val His His Ser 
875 880 885 

Phe Lys Asn Ala Val Val Pro He Lys Leu Val Pro Asp Leu Tyr 
890 895 900 

Phe Leu Lys Pro Gly Asp Lys Pro His Lys Tyr Lys Thr Ser Leu 
905 910 915 

Leu Leu Gin Asn Ser Ala Asp Ser Asp He Ser Asp Leu He Ser 
920 925 930 

Phe Phe Thr Ser Gin Asn He Met Val Thr Met He Asn Asp Ser 
935 940 945 

Asp Tyr Val Ser Val Ala Pro His Ser Ala Ala Leu Asn Val Val 
950 955 960 

His Ser Glu Lys Asp Tyr Val Phe Ala Ala Val Phe Asn Ser Thr 
965 970 975 

Met Val Tyr Ser Leu Pro He Leu Val Asn He He Ser Asn Tyr 
980 985 990 

Tyr Leu Tyr His Leu Asn Val Thr Glu Thr He Gin He Trp Ser 
* 995 1000 1005 

Thr Pro Phe Phe Gin Glu He Thr Asp He Val Phe Lys He Glu 
1010 1015 I 020 

Leu Tyr Phe Gin Ala Ala Leu Leu Gly He He Val Thr Ala Met 
1025 1030 1035 

Pro Pro Tyr Phe Ala Met Glu Asn Ala Glu Asn His Lys He Lys 
1040 1045 1050 

Ala Tyr Thr Gin Leu Lys Leu Ser Gly Leu Leu Pro Ser Ala Tyr 
1055 1060 1065 

Trp He Gly Gin Ala Val Val Asp He Pro Leu Phe Phe He lie 
1070 1075 1080 

Leu He Leu Met Leu Gly Ser Leu Leu Ala Phe His Tyr Gly Leu 
1085 1090 1095 

Tyr Phe Tyr Thr Val Lys Phe Leu Ala Val Val Phe Cys Leu lie 
1100 H05 IHO 

Gly Tyr Val Pro Ser Val He Leu Phe Thr Tyr He Ala Ser Phe 
1115 1120 H25 

Thr Phe Lys Lys He Leu Asn Thr Lys Glu Phe Trp Ser Phe lie 
1130 H35 H40 

Tyr Ser' Val Ala Ala Leu Ala Cys He Ala He Thr Glu He Thr 
1145 1150 H55 

Phe Phe Met Gly Tyr Thr He Ala Thr He Leu His Tyr Ala Phe 
1160 H65 H'O 

Cys He He He Pro He Tyr Pro Leu Leu Gly Cys Leu He Ser 
1175 1180 H85 

Phe He Lys He Ser Trp Lys Asn Val Arg Lys Asn Val Asp Thr 
1190 1195 1200 

Tyr Asn Pro Trp Asp Arg Leu Ser Val Ala Val He Ser Pro Tyr 
1205 1210 1215 

Leu Gin Cys Val Leu Trp He Phe Leu Leu Gin Tyr Tyr Glu Lys 
1220 1225 1230 

Lys Tyr Gly Gly Arg Ser He Arg Lys Asp Pro Phe Phe Arg Asn 
1235 1240 1245 

Leu Ser Thr Lys Ser Lys Asn Arg Lys Leu Pro Glu Pro Pro Asp 
1250 1255 1260 

Asn Glu Asp Glu Asp Glu Asp Val Lys Ala Glu Arg Leu Lys Val 
1265 . 1270 1275 
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Lys Glu Leu Met Gly Cys Gin Cys Cys Glu Glu Lys Pro Ser lie 
1280 1285 1290 

Met Val Ser Asn Leu His Lys Glu Tyr Asp Asp Lys Lys Asp Phe 
1295 1300 I 3015 

Leu Leu Ser Arg Lys Val Lys Lys Val Ala Thr Lys Tyr He Ser 
1310 1315 I 320 

Phe Cys Val Lys Lys Gly Glu He Leu Gly Leu Leu Gly Pro Asn 
1325 1330 1335 

Gly Ala Gly Lys Ser Thr He He Asn He Leu Val Gly Asp He 
1340 1345 l« u 

Glu Pro Thr Ser Gly Gin Val Phe Leu Gly Asp Tyr Ser Ser Glu 
I355 1360 13°5 

Thr Ser Glu Asp Asp Asp Ser Leu Lys Cys Met Gly Tyr Cys Pro 
1370 1375 1380 

Gin lie Asn Pro Leu Trp Pro Asp Thr Thr Leu Gin Glu His Phe 
1385 1390 I 3 " 

Glu He Tyr Gly Ala Val Lys Gly Met Ser Ala Ser Asp Met Lys 
1400 1405- J-"" 

Glu Val lie Ser Arg He Thr His Ala Leu Asp Leu Lys Glu His 

1415 1420 
Leu Gin Lys Thr Val Lys Lys Leu Pro Ala Gly He Lys Arg Lys 
1430 1435 iftftu 

Leu Cys Phe Ala Leu Ser Met Leu Gly Asn Pro Gin He Thr Leu 
1445 1450 imd 

Leu Asp Glu Pro Ser Thr Gly Met Asp Pro Lys Ala Lys Gin His 
1460 1465 J-*' u 

Met Trp Arg Ala He Arg Thr Ala ^Lys Asn Arg Lys Arg_Ala 

Ala He Leu Thr^Thr His Tyr Met Glu Glu Ala Glu Ala Val Cys 
1490 1495 I 500 

Asp Arg Val Ala He Met Val Ser Gly Gin Leu Arg Cys He Gly 
1505 1510 J.3J.O 

Thr Val Gin His Leu Lys Ser Lys Phe Gly Lys Gly Tyr Phe Leu 

1520 1525 I 530 

Glu He Lys Leu Lys Asp Trp He Glu Asn Leu Glu Val Asp Arg 

1535 1540 
Leu Gin Arg Glu lie Gin Tyr He Phe Pro Asn Ala Ser Arg Gin 

1550 I 555 ^ 1D 

Glu Ser Phe Ser Ser He Leu Ala Tyr Lys He Pro Lys Glu Asp 

1565 1570 1575 

Val Gin Ser Leu Ser Gin Ser Phe Phe Lys Leu Glu Glu Ala Lys 

1580 1585 1590 

His Ala Phe Ala lie Glu Glu Tyr Ser Phe Ser Gin Ala Thr Leu 

1595 1600 I 605 

Glu Gin Val Phe Val Glu Leu Thr Lys Glu Gin Glu Glu Glu Asp 

1610 1615 1 6 20 

Asn Ser Cys Gly Thr Leu Asn Ser Thr Leu Trp Trp Glu Arg Thr 

1625 1^30 I 635 

Gin Glu Asp -Arg Val Val Phe 

1640 

<210> 4 

<211> 659 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3105257CD1 

Met°Gly Arg Gly Ala Gly Ala Ala Leu Gly Arg Trp Ser Arg Ala 
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Pro Leu Glu Glu Leu Leu Pro Gly Arg Gly Ser Gly Arg Leu Gly 



Gly Pro Arg Gly Pro Arg Thr Ala Pro Gly Ala Val Gly Leu Gly 

Pro Ala Ala Ala Gly Glu Glu Ala Trp Arg Arg Gly Arg Ala Ala 
50 

Pro Ser Arg Asp Asp Gin Arg Leu Arg Pro Met Ala Pro Gly Leu 

65 ' u -I 

Ser Glu Ala Gly Lys Leu Leu Gly Leu Glu Tyr Pro Glu Arg Gin 

80 

Arg Leu Ala Ala Ala Val Gly Phe Leu Thr Met Ser Gly Val lie 

Ser Met Ser Ala Pro Phe Phe Leu Gly Lys lie lie Asp Ala lie 

110 115 
Tyr Thr Asn Pro Thr Val Asp Tyr Ser Asp Asn Leu Thr Arg Leu 

125 130 
Cys Leu Gly Leu Ser Ala Val Phe Leu Cys Gly Ala Ala Ala Asn 



Ala lie Arg Val Tyr Leu Met Gin Thr Ser Gly Gin Arg He Val 



145 



155 

Asn Arg Leu Arg Thr Ser Leu Phe Ser Ser lie Leu Arg Gin Glu 

170 175 
Val Ala Phe Phe Asp Lys Thr Arg Thr Gly Glu Leu He Asn Arg 

185 190 ^ 

Leu Ser Ser Asp Thr Ala Leu Leu Gly Arg Ser Val Thr Glu Asn 



160 



Leu Ser Asp Gly Leu Arg Ala Gly Ala Gin Ala Ser Val Gly lie 



205 



2^5 

Ser Met Met Phe Phe Val Ser Pro Asn Leu Ala Thr Phe Val Leu 



220 



Ser Val Val Pro III Val Ser lie He Ala Val He Tyr Gly Arg 

245 ol)U 
Tyr Leu Arg Lys Leu Thr Lys Val Thr Gin Asp Ser Leu Ala Gin 

260 «oD 
Ala Thr Gin Leu Ala Glu Glu Arg He Gly Asn Val Arg Thr Val 

275 2oU 
Arg Ala Phe Gly Lys Glu Met Thr Glu lie Glu Lys Tyr Ala Ser 



235 



290 — - 

Lys Val Asp His Val Met Gin Leu Ala Arg Lys Glu Ala Phe Ala 



305 v _-. 

Arg Ala Gly Phe Phe Gly Ala Thr Gly Leu Ser Gly Asn Leu He 



310 



320 " " ~ 

Val Leu Ser Val Leu Tyr Lys Gly Gly Leu Leu Met Gly Ser Ala 

Phe 

350 3 55 3 f° 

Trp Val Gly He Ser He Gly Gly Leu Ser Ser Phe Tyr Ser Glu 

365 370 
Leu Met Lys Gly Leu Gly Ala Gly Gly Arg Leu Trp Glu Leu Leu 
380 

Glu Arg Glu Pro Lys Leu Pro Phe Asn Glu Gly Val He Leu Asn 

395 40U 
Glu Lys Ser Phe Gin Gly Ala Leu Glu Phe Lys Asn Val His Phe 

410 415 
Ala Tyr Pro Ala Arg Pro Glu Val Pro lie Phe Gin Asp Phe Ser 

425 430 
Leu Ser He Pro Ser Gly Ser Val Thr Ala Leu Val Gly Pro Ser 

440 445 * 

Gly Ser Gly Lys Ser Thr Val Leu Ser Leu Leu Leu Arg Leu Tyr 

455 460 
Asp Pro Ala Ser Gly Thr He Ser Leu Asp Gly His Asp He Arg 



325 



335 " 340 



His Met Thr Val Gly Glu Leu Ser Ser Phe Leu Met Tyr Ala Phe 



470 



475 



Gin Leu Asn Pro Val Trp Leu Arg Ser Lys lie Gly Thr Val Ser 
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485 490 495 

Gin Glu Pro lie Leu Phe Ser Cys Ser lie Ala Glu Asn lie Ala 
500 505 jxu 

Tyr Gly Ala Asp Asp Pro Ser Ser Val Thr Ala Glu Glu He Gin 
515 520 3 ^- > 

Arg Val Ala Glu Val Ala Asn Thr Val Ala Phe lie Arg Asn Phe 

530 535 
Pro Gin Gly Phe Asn Thr Val Val Gly Glu Lys Gly Val Leu Leu 
545 550 DDD 

Ser Gly Gly Gin Lys Gin Arg He Ala lie Ala Arg Ala Leu Leu 

560 565 
Lys Asn Pro Lys lie Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu 
575 580 DOZ} 

Asp Ala Glu Asn Glu Tyr Leu Val Gin Glu Ala Leu Asp Arg Leu 
590 595 DUU 

Met Asp Gly Arg Thr Val Leu Val He Ala His Arg Leu Ser Thr 
605 °10 ° ±Z} 

lie Lys Asn Ala Asn Met Val Ala Val Leu Asp Gin Gly Lys lie 

620 625 " 

Thr Glu Tyr Gly Lys His Glu Glu Leu Leu Ser Lys Pro Asn Gly 

635 640 
He Tyr Arg Lys Leu Met Asn Lys Gin Ser Phe He Ser Ala 
650 655 

<210> 5 
<211> 1592 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3200979CD1 

Serial Lys Lys Glu He Ser Val Arg Gin Gin He Gin Ala Leu 

1 5 10 

Leu Tyr Lys Asn Phe Leu Lys Lys Trp Arg lie Lys Arg Glu Phe 
20 25 JU 

He Gly Leu Tyr Leu Cys He Phe Ser Glu His Phe Arg Ala Thr 

35 40 
Arg Phe Pro Glu Gin Pro Pro Lys Val Leu Gly Ser Val Asp Gin 

50 55 
Phe Asn Asp Ser Gly Leu Val Val Ala Tyr Thr Pro Val Ser Asn 

65 70 
He Thr Gin Arg lie Met Asn Lys Met Ala Leu Ala Ser Phe Met 

80 85 m 

Lys Gly Arg Thr Val He Gly Thr Pro Asp Glu Glu Thr Met Asp 
y J 95 100 105 

He Glu Leu Pro Lys Lys Tyr His Glu Met Val Gly Val He Phe 

110 115 Z 

Ser Asp Thr Phe Ser Tyr Arg Leu Lys Phe Asn Trp Gly Tyr Arg 
125 l^O XJD 

He Pro Val He Lys Glu His Ser Glu Tyr Thr Glu His Cys Trp 

140 145 
Ala Met His Gly Glu He Phe Cys Tyr Leu Ala Lys Tyr Trp Leu 
155 160 X0D 

Lys Gly Phe Val Ala Phe Gin Ala Ala He Asn Ala Ala He He 
170 175 V 

Glu Val Thr Thr Asn His Ser Val Met Glu Glu Leu Thr Ser Val 
185 190 ±yD 

He Gly He Asn Met Lys He Pro Pro Phe He Ser Lys Gly Glu 

200 205 
He Met Asn Glu Trp Phe His Phe Thr Cys Leu Val Ser Phe Ser 
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215 220 225 

Ser Phe lie Tyr Phe Ala Ser Leu Asn Val Ala Arg Glu Arg Gly 

230 235 
Lys Phe Lys Lys Leu Met Thr Val Met Gly Leu Arg Glu Ser Ala 

245 250 ZOD 

Phe Trp Leu Ser Trp Gly Leu Thr Tyr lie Cys Phe lie Phe lie 

260 265 
Met Ser He Phe Met Ala Leu Val He Thr Ser He Pro He Val 

275 280 
Phe His Thr Gly Phe Met Val He Phe Thr Leu Tyr Ser Leu Tyr 

290 295 
Gly Leu Ser Leu He Ala Leu Ala Phe Leu Met Ser Val Leu lie 

305 310 
Arg Lys Pro Met Leu Ala Gly Leu Ala Gly Phe Leu Phe Thr Val 

Phe Trp Gly Cys Leu Gly Phe Thr Val Leu Tyr Arg Gin Leu Pro 

Leu Ser Leu Gly Trp* Val Leu Ser Leu Leu Ser Pro Phe Ala Phe 

350 355 360 

Thr Ala Gly Met Ala Gin He Thr His Leu Asp Asn Tyr Leu Ser 

365 370 
Gly Val He Phe Pro Asp Pro Ser Gly Asp Ser Tyr Lys Met He 

380 38 ^ 
Ala Thr Phe Phe He Leu Ala Phe Asp Thr Leu Phe Tyr Leu He 

395 400 *Y 

Phe Thr Leu Tyr Phe Glu Arg Val Leu Pro Asp Lys Asp Gly His 

410 415 
Gly Asp Ser Pro Leu Phe Phe Leu Lys Ser Ser Phe Trp Ser Lys 

425 430 
His Gin Asn Thr His His Glu He Phe Glu Asn Glu He Asn Pro 

440 445 
Glu His Ser Ser Asp Asp Ser Phe Glu Pro Val Ser Pro Glu Phe 

455 460 
His Gly Lys Glu Ala He Arg He Arg Asn Val He Lys Glu Tyr 

Asn Gly Lys Thr Gly Lys Val Glu Ala Leu Gin Gly He Phe Phe 

485 490 
Asp He Tyr Glu Gly Gin He Thr Ala He Leu Gly His Asn Gly 

500 505 DXV 

Ala Gly Lys Ser Thr Leu Leu Asn He Leu Ser Gly Leu Ser Val 

Ser Thr Glu Gly Ser Ala Thr He Tyr Asn Thr Gin Leu Ser Glu 

He Thr Asp Met Glu Glu He Arg Lys Asn He Gly Phe Cys Pro 

545 550 0=>:> 

Gin Phe Asn Phe Gin Phe Asp Phe Leu Thr Val Arg Glu Asn Leu 

560 565 
Arg val Phe Ala Lys He Lys Gly He Gin Pro Lys Glu Val Glu 

575 580 303 

Gin Glu Val Leu Leu Leu Asp Glu Pro Thr Ala Gly Leu Asp Pro 

590 595 
Phe Ser Arg His Arg Val Trp Ser Leu Leu Lys Glu His Lys Val 

605 610 6 

Asp Arg Leu He Leu Phe Ser Thr Gin Phe Met Asp Glu Ala Asp 

620 625 
He Leu Ala Asp Arg Lys Val Phe Leu Ser Asn Gly Lys Leu Lys 

635 640 
Cys Ala Gly Ser Ser Leu Phe Leu Lys Arg Lys Trp Gly He Gly 

650 655 
Tyr His Leu Ser Leu His Arg Asn Glu Met Cys Asp Thr Glu Lys 

He Thr Ser Leu He Lys Gin His He Pro Asp Ala Lys Leu Thr 
680 685 690 
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Thr Glu Ser Glu Glu Lys Leu Val Tyr Ser Leu Pro Leu Glu Lys 
695 700 705 

Thr Asn Lys Phe Pro Asp Leu Tyr Ser Asp Leu Asp Lys Cys Ser 

710 715 
Asp Gin Gly He Arg Asn Tyr Ala Val Ser Val Thr Ser Leu Asn 

725 730 
Glu Val Phe Leu Asn Leu Glu Gly Lys Ser Ala He Asp Glu Pro 
740 7 * 5 750 

Asp Phe Asp He Gly Lys Gin Glu Lys He His Val Thr Arg Asn 
755 760 765 

Thr Gly Asp Glu Ser Glu Met Glu Gin Val Leu Cys Ser Leu Pro 
770 77 5 780 

Glu Thr Arg Lys Ala Val Ser Ser Ala Ala Leu Trp Arg Arg Gin 
785 7 90 795 

He Tyr Ala Val Ala Thr Leu Arg Phe Leu Lys Leu Arg Arg Glu 
800 805 »1° 

Arg Arg Ala Leu Leu Cys Leu Leu Leu Val Leu Gly He Ala Phe 
815 820 825 

He Pro He He Leu Glu Lys He Met Tyr Lys Val Thr Arg Glu 
830 835 840 

Thr His Cys Trp Glu Phe Ser Pro Ser Met Tyr Phe Leu Ser Leu 

850 855 



845 



Glu Gin He Pro Lys- Thr Pro Leu Thr Ser Leu Leu He Val Asn 
860 865 870 

Asn Thr Gly Ser Asn He Glu Asp Leu Val His Ser Leu Lys Cys 
875 880 885 

Gin Asp He Val Leu Glu He Asp Asp Phe Arg Asn Arg Asn Gly 
890 895 900 

Ser Asp Asp Pro Ser Tyr Asn Gly Ala He He Val Ser Gly Asp 

905 910 
Gin Lys Asp Tyr Arg Phe Ser Val Ala Cys Asn Thr Lys Lys Ser 
920 925 930 

Asn Cys Phe Pro Val Leu Met Gly He Val Ser Asn Ala Leu lie 
935 940 945 

Gly He Phe Asn Phe Thr Glu Leu He Gin Met Glu Ser Thr Ser 
950 955 960 

Phe Phe Arg Asp Asp He Val Leu Asp Leu Gly Phe He Asp Gly 
965 970 975 

Ser He Phe Leu Leu Leu He Thr Asn Cys He Ser Pro Tyr lie 
980 985 990 

Gly He Ser Ser He Ser Asp Tyr Lys He Pro Ser Ser He Pro 
* 995 1000 1005 

Ser He Leu Cys Gin Lys Asn Val Gin Ser Gin Leu Trp He Ser 
1010 1015 1020 

Gly Leu Trp Pro Ser Ala Tyr Trp Cys Gly Gin Ala Leu Val Asp . 

1025 1030 1035 

He Pro Leu His Phe Leu He Leu Leu Ser He His Leu He Tyr 
1040 1045 1050 

Tyr Phe Ser Phe Leu Gly Phe Gin Leu Pro Trp Glu Leu Met Phe 

1055 1° 60 10 

Val Leu Val Val Cys He He Gly Cys Ala Ala Ser Leu He Phe 
1070 10*75 1080 

Leu Met Tyr Val Leu Ser Phe He Phe Cys Lys Trp Arg Lys Asn 
10 85 1090 1095 

Asn Gly Phe Trp Ser Phe Gly Phe Phe He Val Leu He Cys Val 
1100 H05 Hl° 

Ser Thr He Leu Val Ser Thr Lys Tyr Glu Lys Pro Asn Leu lie 
H15 1120 H25 

Leu Cys Met He Phe He Pro Ser P^Thr phe Leu ^ 

Leu Leu He Gin Leu Asn Phe Met Tyr Met Arg Asn Leu Asp Ser 
H45 1150 H55 

' Leu Asp Asn Arg He Asn Glu Val Asn Lys Thr He Leu Leu Thr 
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1160 H65 • H70 

Asn Leu He Pro Tyr Leu Gin Ser Val He Phe Leu Phe Val lie 
H75 1180 H85 

Arg Cys Leu Glu Met Lys Tyr Gly Asn Glu He Met Asn Lys Asp 
1190 1195 1200 

Pro Val Phe Arg He Ser Pro Arg Ser Arg Gly Thr His Thr Asn 
1205 1210 .1215 

Pro Glu Glu Pro Glu Glu Asp Val Gin Ala Glu Arg Val Gin Ala 
1220 1225 1230 

Ala Asn Ala Leu Thr Thr Pro Asn Leu Glu Glu Glu Pro Val Ue 
1235 1240 1245 

Thr Ala Ser Cys Leu His Lys Glu Tyr Tyr Glu Thr Lys Lys Ser 
1250 1255 1260 

Cys Phe Ser Thr Thr Lys Lys Lys Ala Ala He Arg Asn Val Ser 
1265 1270 1275 

Phe Cys Val Lys Lys Gly Glu Val Leu Gly Leu Leu Gly His Asn 
1280 1285 1290 

Gly Ala Gly Lys Ser Thr Ser He Lys Met He Thr Gly Cys Thr 
1295 1300 1305 

Val Pro Thr Ala Gly Val Val Val Leu Gin Gly Asn Arg Ala Ser 
1310 1315 I 320 

Val Arg Gin Gin Arg Asp Asn Ser Leu Lys Phe Leu Gly Tyr Cys 
1325 1330 1335 

Pro Gin Glu Asn Ser Leu Trp Pro Lys Leu Thr Met Lys Glu His 
1340 1345- 1350 

Leu Glu Leu Tyr Ala Ala Val Lys Gly Leu Gly Lys Glu Asp Ala 
1355 1360 1365 

Ala Leu Ser He Ser Arg Leu Val Glu Ala Leu Lys Leu Gin Glu 
1370 1375 1380 

Gin Leu Lys Ala Pro Val Lys Thr Leu Ser Glu Gly He Lys Arg 
1385 1390 1395 

Lys Leu Cys Phe Val Leu Ser He Leu Gly Asn Pro Ser Val Val 
1400 1405 1410 

Leu Leu Asp Glu Pro Phe Thr Gly Met Asp Pro Glu Gly Gin Gin 
1415 1420 1425 

Gin Met Trp Gin He Leu Gin Ala Thr He Lys Asn Gin Glu Arg 
1430 1435 1440 

Gly Thr Leu Leu Thr Thr His Tyr Met Ser Glu Ala Lys Ser Leu 
1445 1450 1455 

Cys Asp Arg Val Ala He Met Val Ser Gly Thr Leu Arg Cys He 
1460 1465 14 ' u 

Gly Ser He Gin His Leu Lys Asn Lys Phe Gly Lys Asp Tyr Leu. 

* 1475 1480 1485 

Leu Glu He Lys Met Lys Glu Pro Thr Gin Val Glu Ala Leu His 
1490 1495 1500 

Thr Glu He Leu Lys Leu Phe Pro Gin Ala Ala Trp Gin Glu Arg 
1505 1510 1515 

Tyr Ser Ser Leu Met Ala Tyr Lys Leu Pro Val Glu Asp Val His 
1520 1525 1530 

Pro Leu Ser Arg Ala Phe Phe Lys Leu Glu Ala Met Lys Gin Thr 
1535 1540 1545 

Phe Asn Leu Glu Glu Tyr Ser Leu Ser Gin Ala Thr Leu Glu Gin 
1550 1555 1560 

Val Phe Leu Glu Leu Cys Lys Glu Gin Glu Leu Gly Asn Val Asp 
1565 1570 1575 

Asp Lys He Asp Thr Thr Val Glu Trp Lys Leu Leu Pro Gin Glu 
1580 1585 1590 

Asp Pro 



<210> 6 
<211> 382 
<212> PRT 



12/85 



WO 02/12340 



PCT/US01/24217 



<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 6754139CD1 

<400> 6 , ^ 

Met Asp Glu Arg Asn Gin Val Leu Thr Leu Tyr Leu Trp He Arg 

15 10 15 

Gin Glu Trp Thr Asp Ala Tyr Leu Arg Trp Asp Pro Asn Ala Tyr 

20 25 30 

Gly Gly Leu Asp Ala He Arg He Pro Ser Ser Leu Val Trp Arg 

35 40 45 

Pro Asp He Val Leu Tyr Asn Lys Ala Asp Ala Gin Pro Pro Gly 

50 55 60 

Ser Ala Ser Thr Asn Val Val Leu Arg His Asp Gly Ala Val Arg 

65 70 75 

Trp Asp Ala Pro Ala He Thr Arg Ser Ser Cys Arg Val Asp Val 

80 85 90 

Ala Ala Phe Pro Phe Asp Ala Gin His Cys Gly Leu Thr Phe Gly 

95 100 105 

Ser Trp Thr His Gly Gly His Gin Val Asp Val Arg Pro Arg Gly 
110 H5 120 

Ala Ala Ala Ser Leu Ala Asp Phe Val Glu Asn Val Glu Trp Arg 

125 130 135 

Val Leu Gly Met Pro Ala Arg Arg Arg Val Leu Thr Tyr Gly Cys 

140 145 150 

Cvs Ser Glu Pro Tyr Pro Asp Val Thr Phe Thr Leu Leu Leu Arg 

155 160 165 

Arc Arg Ala Ala Ala Tyr Val Cys Asn Leu Leu Leu Pro Cys Val 

170 175 180 

Leu He Ser Leu Leu Ala Pro Leu Ala Phe His Leu Pro Ala Asp 

185 190 195 

Ser Gly Glu Lys Val Ser Leu Gly Val Thr Val Leu Leu Ala Leu 

200 205 210 

Thr Val Phe Gin Leu Leu Leu Ala Glu Ser Met Pro Pro Ala Glu 

215 220 225 

Ser Val Pro Leu He Gly Lys Tyr Tyr Met Ala Thr Met Thr Met 

230 235 240 

Val Thr Phe Ser Thr Ala Leu Thr He Leu He Met Asn Leu His 

245 250 255 

Tyr Cys Gly Pro Ser Val Arg Pro Val Pro Ala Trp Ala Arg Ala 

260 265 270 

Leu Leu Leu Gly His Leu Ala Arg Gly Leu Cys Val Arg Glu Arg 

275 280 285 

Glv Glu Pro Cys Gly Gin Ser Arg Pro Pro Glu Leu Ser Pro Ser 

290 295 300 

Pro Gin Ser Pro Glu Gly Gly Ala Gly Pro Pro Ala Gly Pro Cys 

305 310 315 

His Glu Pro Arg Cys Leu Cys Arg Gin Glu Ala Leu Leu His His 

320 325 330 

Val Ala Thr He Ala Asn Thr Phe Arg Ser His Arg Ala Ala Gin 

335 340 . 345 

Arg Cys His Glu Asp Trp Lys Arg Leu Ala Arg Val Met Asp Arg 

350 355 360 

Phe Phe Leu Ala He Phe Phe Ser Met Ala Leu Val Met Ser Leu 

365 370 375 

Leu Val Leu Val Gin Ala Leu 

380 

<210> 7 
<211> 1115 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6996659CD1 

Me?°Lg Arg Leu Ser Leu Trp Trp Leu Leu Ser Arg Val Cys Leu 
1 5 10 



Leu Leu Pro Pro Pro Cys Ala Leu Val Leu Ala Gly Val Pro Ser 

Leu Lys Arg He G 

. Gin Pro Trp Thr T 
50 55 60 



2 0 

Ser Ser Ser His Pro Gin Pro Cys Gin lie Leu Lys Arg lie Gly 
35 40 " 

His Ala Val Arg Val Gly Ala Val His Leu Gin Pro Trp Thr Thr 



Ala Pro Arg Ala Ala Ser Arg Ala Pro Asp Asp Ser Arg Ala Gly 

65 70 
Ala Gin Arg Asp Glu Pro Qlu Pro Gly Thr Arg Arg Ser Pro Ala 

80 " 
Pro Ser Pro Gly Ala Arg Trp Leu Gly Ser Thr Leu His Gly Arg 

95 100 ^ 

Gly Pro Pro Gly Ser Arg Lys Pro Gly Glu Gly Ala Arg Ala Glu 

110 115 
Ala Leu Trp Pro Arg Asp Ala Leu Leu Phe Ala Val Asp Asn Leu 
125 130 i 

Asn Arg Val Glu Gly Leu Leu Pro Tyr Asn Leu Ser Leu Glu Val 

140 145 
Val Met Ala lie Glu Ala Gly Leu Gly Asp Leu Pro Leu Leu Pro 

155 160 " 

Phe Ser Ser Pro Ser Ser Pro Trp Ser Ser Asp Pro Phe Ser Phe 

170 -T 
Leu Gin Ser Val Cys His Thr Val Val Val Gin Gly Val Ser Ala 

185 190 
Leu Leu Ala Phe Pro Gin Ser Gin Gly Glu Met Met Glu Leu Asp 

200 205 
Leu Val Ser Leu Val Leu His lie Pro Val lie Ser lie Val Arg 
215 

His Glu Phe Pro Arg Glu Ser Gin Asn Pro Leu His Leu Gin Leu 



Ser Leu Glu Asn III Leu Ser Ser Asp Ala Asp Val Thr Val Ser 

245 2 " * J 

lie Leu Thr Met Asn Asn Trp Tyr Asn Phe Ser Leu Leu Leu Cys 

260 265 
Gin Glu Asp Trp Asn He Thr Asp Phe Leu Leu Leu Thr Gin Asn 

275 280 * 

Asn Ser Lys Phe His Leu Gly Ser He lie Asn He Thr Ala Asn 
290 

Leu Pro Ser Thr Gin Asp Leu Leu Ser Phe Leu Gin He Gin Leu 

305 310 
Glu Ser He Lys Asn Ser Thr Pro Thr Val Val Met Phe Gly Cys 

320 325 Z. 

Asp Met Glu Ser He Arg Arg He Phe Glu He Thr Thr Gin Phe 

335 340 
Gly Val Met Pro Pro Glu Leu Arg Trp Val Leu Gly Asp Ser Gin 

350 355 
Asn Val Glu Glu Leu Arg Thr Glu Gly Leu Pro Leu Gly Leu lie 

365 3 ™ 
Ala His Gly Lys Thr Thr Gin Ser Val Phe Glu His Tyr Val Gin 

380 385 
Asp Ala Met Glu Leu Val Ala Arg Ala Val Ala Thr Ala Thr Met 

395 400 
He Gin Pro Glu Leu Ala Leu He Pro Ser Thr Met Asn Cys Met 

410 415 
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Glu val Glu Thr Thr Asn Leu Thr Ser Gly Gin Tyr Leu Ser Arg 

425 430 
Phe Leu Ala Asn Thr Thr Phe Arg Gly Leu Ser Gly Ser He Arg 

440 445 
Val Lys Gly Ser Thr He Val Ser Ser Glu Asn Asn Phe Phe lie 

455 4(50 
Trp Asn Leu Gin His Asp Pro Met Gly Lys Pro Met Trp Thr Arg 

• 470 475 
Leu Gly Ser Trp Gin Gly Gly Lys He Val Met Asp Tyr Gly lie 

485 490 . * 

Trp Pro Glu Gin Ala Gin Arg His Lys Thr His Phe Gin His Pro 

500 505 51 

Ser Lys Leu His Leu> Arg Val Val Thr Leu lie Glu His, Pro Phe 

515 520 525 

Val Phe Thr Arg Glu Val Asp Asp Glu Gly Leu Cys Pro Ala Gly 

530 535 , 

Gin Leu Cys Leu Asp Pro Met Thr Asn Asp Ser Ser Thr Leu Asp 

545 0DO 
Ser Leu Phe Ser Ser Leu His Ser Ser Asn Asp Thr Val Pro lie 

560 565 
Lys Phe Lys Lys Cys Cys Tyr Gly Tyr Cys He Asp Leu Leu Glu 

Lys He Ala Glu Asp Met Asn Phe Asp Phe Asp Leu Tyr He Val 

590 595 
Gly Asp Gly Lys Tyr Gly Ala Trp Lys Asn Gly His Trp Thr Gly 

605 610 *>±o 

Leu Val Gly Asp Leu Leu Arg Gly Thr Ala His Met Ala Val Thr 

620 625 
Ser Phe Ser He Asn Thr Ala Arg Ser Gin Val He Asp Phe Thr 

635 

Ser Pro Phe Phe Ser Thr Ser Leu Gly He Leu Val Arg Thr Arg 

650 655 
Asp Thr Ala Ala Pro He Gly Ala Phe Met Trp Pro Leu His Trp 

665 670 
Thr Met Trp Leu Gly He Phe Val Ala Leu His He Thr Ala Val 

Phe Leu Thr Leu Tyr Glu Trp Lys Ser Pro Phe Gly Leu Thr Ser 

695 700 
Lys Gly Arg Asn Arg Ser Lys Val Phe Ser Phe Ser Ser Ala Leu 

710 715 , 720 

Asn He Cys Tyr Ala Leu Leu Phe Gly Arg Thr Val Ala He Lys 

725 730 / - SD 

Pro Pro Lys Cys Trp Thr Gly Arg Phe Leu Met Asn Leu Trp Ala 

740 745 , 

He Phe Cys Met Phe Cys Leu Ser Thr Tyr Thr Ala Asn Leu Ala 

755 760 > bi> 

Ala Val Met Val Gly Glu Lys He Tyr Glu Glu Leu Ser Gly He 

- . 770 775 
His Asp Pro Lys Leu His His Pro Ser Gin Gly Phe Arg Phe Gly 

Thr Val Arg Glu Ser Ser Ala Glu Asp Tyr Val Arg Gin Ser Phe 

800 805 8 r 

Pro Glu Met His Glu Tyr Met Arg Arg Tyr Asn Val Pro Ala Thr 

815 820 825 

Pro Asp Gly Val Glu Tyr Leu Lys Asn Asp Pro Glu Lys Leu Asp 

830 835 ? 

Ala Phe He Met Asp Lys Ala Leu Leu Asp Tyr Glu Val Ser He 

845 650 855 

Asp Ala Asp Cys Lys Leu Leu Thr Val Gly Lys Pro Phe Ala He 

S60 865 mu 870 

Glu Gly Tyr Gly He Gly Leu Pro Pro Asn Ser Pro Leu Thr Ala 

875 880 
Asn He Ser Glu Leu He Ser Gin Tyr Lys Ser His Gly Phe Met 
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890 895 900 

Asp Met Leu His Asp Lys Trp Tyr Arg Val Val Pro Cys Gly Lys 

905 910 915 

Arg Ser Phe Ala Val Thr Glu Thr Leu Gin Met Gly He Lys His 

920 925 930 

Phe Ser Gly Leu Phe Val Leu Leu Cys lie Gly Phe Gly Leu Ser 

He Leu Thr Thr lie Gly Glu His He Val Tyr Arg Leu Leu Leu 

950 955 960 

Pro Arg He Lys Asn Lys Ser Lys Leu Gin Tyr Trp Leu His Thr 
965 970 975 

Ser Gin Arg Leu His Arg Ala He Asn Thr Ser Phe He Glu Glu 
980 985 990 

Lys Gin Gin His Phe Lys Thr Lys Arg Val Glu Lys Arg Ser Asn 
995 1000 1005 

Val Gly Pro Arg Gin Leu Thr Val Trp Asn Thr Ser Asn Leu Ser 

10 10 1015 1° 20 

His Asp Asn Arg Arg Lys Tyr He Phe Ser Asp Glu Glu Gly Gin 

1025 1030 1035 

Asn Gin Leu Gly He Arg He His Gin Asp He Pro Leu Pro Pro 

1040 1045 1050 

Arg Arg Arg Glu Leu Pro Ala Leu Arg Thr Thr Asn Gly Lys Ala 

1055 1060 1065 

Asp Ser Leu Asn Val Ser Arg Asn Ser Val Met Gin Glu Leu Ser 

1070 1075 1080 

Glu Leu Glu Lys Gin He Gin Val He Arg Gin Glu Leu Gin Leu 

1085 1090 1095 

Ala Val Ser Arg Lys Thr Glu Leu Glu Glu Tyr Gin Arg Thr Ser 

1100 HO 5 1110 

Arg Thr Cys Glu Ser 

1115 

<210> 8 
<211> 295 
<212> PRT 
' <213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7472747CD1 

Met°Pro Ser Ala Gly Leu Cys Ser Cys Trp Gly Gly Arg Val Leu 
1 5 1° 15 

Pro Leu Leu Leu Ala Tyr Val Cys Tyr Leu Leu Leu Gly Ala Thr 

20 25 30 

He Phe Gin Leu Leu Glu Arg Gin Ala Glu Ala Gin Ser Arg Asp 

35 40 45 

Gin Phe Gin Leu Glu Lys Leu Arg Phe Leu Glu Asn Tyr Thr Cys 

50 55 60 

Leu Asp Gin Trp Ala Met Glu Gin Phe Val Gin Val He Met Glu 

65 70 75 

Ala Trp Val Lys Gly Val Asn Pro Lys Gly Asn Ser Thr Asn Pro 

80 85 90 

Ser Asn Trp Asp Phe Gly Ser Ser Phe Phe Phe Ala Gly Thr Val 

95 100 1°5 

Val Thr Thr He Gly Tyr Gly Asn Leu Ala Pro Ser Thr Glu Ala 
110 H5 120 

Gly Gin Val Phe Cys Val Phe Tyr Ala Leu Leu Gly He Pro Leu 
125 130 135 

Asn Val He Phe Leu Asn His Leu Gly Thr Gly Leu Arg Ala His 
140 145 150 

Leu Ala Ala He Glu Arg Trp Glu Asp Arg Pro Arg Arg Ser Gin 
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155 160 I 65 

Glu Val Leu Gin Val Leu Gly Leu Ala Leu Phe Leu Thr Leu Gly 

170 I 75 . 180 

Thr Leu Val lie Leu He Phe Pro Pro Met Val Phe Ser His Val 

185 190 195 

Glu Gly Trp Ser Phe Ser Glu Gly Phe Tyr Phe Ala Phe He Thr 

200 205 210 

Leu Ser Thr He Gly Phe Gly Asp Tyr Val Ala Gly Thr Asp Pro 

215 220 
Ser Lys His Tyr He Ser Val Tyr Arg Ser Leu Ala Ala He Trp 

230 235 2 T U 

He Leu Leu Gly Leu Ala Trp Leu Ala Leu He Leu Pro Leu Gly 

245 250 2 

Pro Leu Leu Leu His Arg Cys Cys Gin Leu Trp Leu Leu Ser Arg 

260 2 <55 270 

Gly Leu Gly Val Lys Asp Gly Ala Ala Ser Asp Pro Ser Gly Leu 

275 280 
Pro Arg Pro Gin Lys He Pro He Ser Ala 

290 2 95 

<210> 9 
<211> 384 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7474121CD1 

Met°Glu Val Ser Gly His Pro Gin Ala Arg Arg Cys Cys Pro Glu 
15 1° 15 

Ala Leu Gly Lys Leu Phe Pro Gly Leu Cys Phe Leu Cys Phe Leu 

20 25 , 

Val Thr Tyr Ala Leu Val Gly Ala Val Val Phe Ser Ala He Glu 

35 40 « 

Asp Gly Gin Val Leu Val Ala Ala Asp Asp Gly Glu Phe Glu Lys 

Phe Leu Glu Glu Leu Cys Arg He Leu Asn Cys Ser Glu Thr Val 
~ 65 70 75 

Val Glu Asp Arg Lys Gin Asp Leu Gin Gly His Leu Gin Lys Val 

80 85 
Lys Pro Gin Trp Phe Asn Arg Thr Thr His Trp Ser Phe Leu Ser 
95 100 105 

Ser Leu Phe Phe Cys Cys Thr Val Phe Ser Thr Val Gly Tyr Gly 

110 US I 20 

Tyr He Tyr Pro Val Thr Arg Leu Gly Lys Tyr Leu Cys Met Leu 

125 130 I 35 

Tyr Ala Leu Phe Gly He Pro Leu Met Phe Leu Val Leu Thr Asp 

140 145 I 50 

Thr Gly Asp He Leu Ala Thr He Leu Ser Thr Ser Tyr Asn Arg 

155 160 165 

Phe Arg Lys Phe Pro Phe Phe Thr Arg Pro Leu Leu Ser Lys Trp 

170 175 180 

Cys Pro Lys Ser Leu Phe Lys Lys Lys Pro Asp Pro Lys Pro Ala 

185 190 195 

Asp Glu Ala Val Pro Gin He He He Ser Ala Glu Glu Leu Pro 
P 200 2 05 2 10 

Gly Pro Lys Leu Gly Thr Cys Pro Ser Arg Pro Ser Cys Ser Met 

215 22 0 
Glu Leu Phe Glu Arg Ser His Ala Leu Glu Lys Gin Asn Thr Leu 

230 2 35 240 

Gin Leu Pro Pro Gin Ala Met Glu Arg Ser Asn Ser Cys Pro Glu 
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245 250 255 

Leu Val Leu Gly Arg Leu Ser Tyr Ser He He Ser Asn Leu Asp 

260 265 270 

Glu Val Gly Gin Gin Val Glu Arg Leu Asp He Pro Leu Pro lie 

275 280 285 

He Ala Leu He Val Phe Ala Tyr He Ser Cys Ala Ala Ala lie 

290 295 300 

Leu Pro Phe Trp Glu Thr Gin Leu Asp Phe Glu Asn Ala Phe Tyr 

305 310 315 

Phe Cys Phe Val Thr Leu Thr Thr He Gly Phe Gly Asp Thr Val 

320 325 330 

Leu Glu His Pro Asn Phe Phe Leu Phe Phe Ser He Tyr He lie 

335 340 345 

Val Gly Met Glu He Val Phe He Ala Phe Lys Leu Val Gin Asn 

350 355 360 

Arg Leu He Asp He Tyr Lys Asn Val Met Leu Phe Phe Ala Lys 

365 370 375 

Gly Lys Phe Tyr His Leu Val Lys Lys 

380 

<210> 10 
<211> 769 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 747561SCD1 

<400> 10 mV 
Met Val Ser Pro Lys Met Tyr Leu Ser Thr Glu He Arg Asn Thr 
1 5 10 15 

Phe Arg Leu Pro Ala Pro Gin Thr His Leu Gin Pro Cys Pro Ala 
20 25 30 

Glv Phe Ala His Pro Leu Leu Val Asn Ala Pro Asp Met Ser Gin 
35 40 45 

Pro Arg Pro Arg Tyr Val Val Asp Arg Ala Ala Tyr Ser Leu Thr 
50 55 o° 

Leu Phe Asp Asp Glu Phe Glu Lys Lys Asp Arg Thr Tyr Pro Val 
65 70 75 

Gly Glu Lys Leu Arg Asn Ala Phe Arg Cys Ser Ser Ala Lys He 
80 85 90 

Lvs Ala Val Val Phe Gly Leu Leu Pro Val Leu Ser Trp Leu Pro 
* 95 100 105 

Lys Tyr Lys He Lys Asp Tyr He He Pro Asp Leu Leu Gly Gly 
HO 115 120 

Leu Ser Gly Gly Ser He Gin Val Pro Gin Gly Met Ala Phe Ala 
125 130 135 

Leu Leu Ala Asn Leu Pro Ala Val Asn Gly Leu Tyr Ser Ser Phe 
140 145 150 

Phe Pro Leu Leu Thr Tyr Phe Phe Leu Gly Gly Val His Gin Met 
155 160 165 

Val Pro Gly Thr Phe Ala Val He Ser He Leu Val Gly Asn He 
170 175 180 

Cys Leu Gin Leu Ala Pro Glu Ser Lys Phe Gin Val Phe Asn Asn 
185 190 195 

Ala Thr Asn Glu Ser Tyr Val Asp Thr Ala Ala Met Glu Ala Glu 
200 205 210 

Arg Leu His Val Ser Ala Thr Leu Ala Cys Leu Thr Ala He He 
215 220 225 

Gin Met Gly Leu Gly Phe Met Gin Phe Gly Phe Val Ala He Tyr 
230 235 240 

Leu Ser Glu Ser Phe He Arg Gly Phe Met Thr Ma Ala Gly Leu 
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245 250 255 

Gin He Leu He Ser Val Leu Lys Tyr He Phe Gly Leu Thr lie 
260 265 270 

Pro Ser Tyr Thr Gly Pro Gly Ser He Val Phe Thr Phe He Asp 
275 280 285 

He Cys Lys Asn Leu Pro His Thr Asn He Ala Ser Leu He Phe 
290 295 300 

Ala Leu He Ser Gly Ala Phe Leu Val Leu Val Lys Glu Leu Asn 
305 310 315 

Ala Arg Tyr Met His Lys He Arg Phe Pro He Pro Thr Glu Met 
320 325 330 

He Val Val Val Val Ala Thr Ala He Ser Gly Gly Cys Lys Met 
335 340 345 

Pro Lys Lys Tyr His Met Gin He Val Gly Glu He Gin Arg Gly 
350 355 360 

Phe Pro Thr Pro Val Ser Pro Val Val Ser Gin Trp Lys Asp Met 
365 370 375 

He Gly Thr Ala Phe Ser Leu Ala He Val Ser Tyr Val He Asn 
380 385 390 

Leu Ala Met Gly Arg Thr Leu Ala Asn Lys His Gly Tyr Asp Val 
395 400 405 

Asp Ser Asn Gin Glu Met He Ala Leu Gly Cys Ser Asn Phe Phe 
410 415 42 0 

Gly Ser Phe Phe Lys He His Val He Cys Cys Ala Leu Ser Val 
425 43 0 435 

Thr Leu Ala Val Asp Gly Ala Gly Gly Lys Ser Gin Ser Val Leu 
440 44 5 4:>u 

Gly Ala Leu He Ala Val Asn Leu Lys Asn Ser Leu Lys Gin Leu 
455 4 60 4 »5 

Thr Asp Pro Tyr Tyr Leu Trp Arg Lys Ser Lys Leu Asp Cys Cys 
470 475 480 

He Trp Val Val Ser Phe Leu Ser Ser Phe Phe Leu Ser Leu Pro 
485 4 90 495 

Tyr Gly Val Ala Val Gly Val Ala Phe Ser Val Leu Val Val Val 
500 505 510 

Phe Gin Thr Gin Phe Arg Asn Gly Tyr Ala Leu Ala Gin Val Met 
515 520 52b 

Asp Thr Asp He Tyr Val Asn Pro Lys Thr Tyr Asn Arg Ala Gin 
530 535 540 

Asp He Gin Gly He Lys He He Thr Tyr Cys Ser Pro Leu Tyr 
545 550 ->->-> 

Phe Ala Asn Ser Glu He Phe Arg Gin Lys Val He Ala Lys Thr 
560 565 570 

Val Ser Leu Gin Glu Leu Gin Gin Asp Phe Glu Asn Ala Pro Pro 
575 580 585 

Thr Asp Pro Asn Asn Asn Gin Thr Pro Ala Asn Gly Thr Ser Val 
590 595 600 

Ser Tyr He Thr Phe Ser Pro Asp Ser Ser Ser Pro Ala Gin Ser 
605 610 615 

Glu Pro Pro Ala Ser Ala Glu Ala Pro Gly Glu Pro Ser Asp Met 
620 625 630 

Leu Ala Ser Val Pro Pro Phe Val Thr Phe His Thr Leu He Leu 
635 640 645 

Asp Met Ser Gly Val Ser Phe Val Asp Leu Met Gly He Lys Ala 
650 655 660 

Leu Ala Lys Leu Ser Ser Thr Tyr Gly Lys He Gly Val Lys Val 
665 670 675 

Phe Leu Val Asn He His Ala Gin Val Tyr Asn Asp He Ser Has 
680 685 690 

Gly Gly Val Phe Glu Asp Gly Ser Leu Glu Cys Lys His Val Phe 
y 695 700 705 

Pro Ser He His Asp Ala Val Leu Phe Ala Gin Ala Asn Ala Arg 
710 715 720 
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Asp val Thr Pro Gly His Asn Phe Gin Gly Ala Pro Gly Asp Ala 

725 '• 3U 
Glu Leu Ser Leu Tyr Asp Ser Glu Glu Asp lie Arg Ser Tyr Trp 

740 74b 
Asp Leu Glu Gin Glu Met Phe Gly Ser Met Phe His Ala Glu Thr 

755 

Leu Thr Ala Leu 

<210> 11 
<211> 882 
<212> PRT 

■<213> Homo sapiens 
<220> 

<221> miscJEeature 

<223> Incyte ID No: 7475656CD1 

Met^Glu^Gly Gly Gly Lys Pro Asn Ser Ser Ser Asn Ser Arg Asp 

1 5 10 

Asp Gly Asn Ser Val Phe Pro Ala Lys Ala Ser Ala Pro Gly Ala 
20 

Gly Pro Ala Ala Ala Glu Lys Arg Leu Gly Thr Pro Pro Gly Gly 
35 40 * 3 

Gly Gly Ala Gly Ala Lys Glu His Gly Asn Ser Val Cys Phe Lys 



50 



65 70 

^rg 

80 

Thr Ser Met Leu Gin Pro Gly Val Asn Lys Phe Ser Leu Arg Met 



val Asp Gly Gly Gly Gly Glu Glu Pro Ala Gly Gly Phe Glu Asp 

Ala Glu Gly Pro A^ **g Gln ^ r Gly P £| M6t Gln 

Ljys 
100 

Phe Gly Ser Gln Lys Ala Val Glu Lys Glu Gln Glu Arg Val Lys 

110 lib 
Thr Ala Gly Phe Trp He He His Pro Tyr Ser Asp Phe Arg Phe 
125 130 n 

Tyr Trp Asp Leu He Met Leu lie Met Met Val Gly Asn Leu Val 

140 145 7_ 

lie lie Pro Val Gly lie Thr Phe Phe Thr Glu Gln Thr Thr Thr 

155 160 xu 

Pro Trp lie lie Phe Asn Val Ala Ser Asp Thr Val Phe Leu Leu 

170 17b 
Asp Leu lie Met Asn Phe Arg Thr Gly Thr Val Asn Glu Asp Ser 

185 190 
Ser Glu lie He Leu Asp Pro Lys Val He Lys Met Asn Tyr Leu 

200 205 
Lys Ser Trp Phe Val Val Asp Phe He Ser Ser He Pro Val Asp 

215 

Tyr He Phe Leu He Val Glu Lys Gly Met Asp Ser Glu Val Tyr 

230 235 
Lys Thr Ala Arg Ala Leu Arg He Val Arg Phe Thr Lys He Leu 

245 250 7 

Ser Leu Leu Arg Leu Leu Arg Leu Ser Arg Leu He Arg Tyr He 

260 265 t 

His Gln Trp Glu Glu He Phe His Met Thr Tyr Asp Leu Ala Ser 

Ala Val Val Arg lie Phe Asn Leu He Gly Met Met Leu Leu Leu 

290 295 
Cys His Trp Asp Gly Cys Leu Gln Phe Leu Val Pro Leu Leu Gln 

Asp Phe Pro Pro Asp Cys Trp Val Ser Leu Asn Glu Met Val Asn 
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Asp Ser Trp Gly Lys Gin Tyr Ser Tyr Ala Leu Phe Lys Ala Met 

335 340 
Ser His Met Leu Cys He Gly Tyr Gly Ala Gin Ala Pro Val Ser 
350 355 360 

Met Ser Asp Leu Trp He Thr Met Leu Ser Met He Val Gly Ala 
365 370 375 

Thr Cys Tyr Ala Met Phe Val Gly His Ala Thr Ala Leu He Gin 
380 • 385 390 

Ser Leu Asp Ser Ser Arg Arg Gin Tyr Gin Glu Lys Tyr Lys Gin 
395 400 405 

Val Glu Gin Tyr Met Ser Phe His Lys Leu Pro Ala Asp Met Arg 

410 415 ^ 

. Gin Lys He His Asp Tyr Tyr Glu His Arg Tyr Gin Gly Lys He 
425 4 30 4i3:} 

Phe Asp Glu Glu Asn He Leu Asn Glu Leu Asn Asp Pro Leu Arg 
440 44 5 4 ->° 

Glu Glu He Val Asn Phe Asn Cys Arg Lys Leu Val Ala Thr Met 
455 • 460 465 

Pro Leu Phe Ala Asn Ala Asp Pro Asn Phe Val Thr Ala Met Leu 
470 475 4 80 

Ser Lys Leu Arg Phe Glu Val Phe Gin Pro Gly Asp Tyr He He 
485 490 495 

Arg Glu Gly Ala Val Gly Lys Lys Met Tyr Phe He Gin His Gly 
500 505 510 

Val Ala Gly Val He Thr Lys Ser Ser Lys Glu Met Lys Leu Thr 
515 520 =25 

Asp Gly Ser Tyr Phe Gly Glu He Cys Leu Leu Thr Lys Gly Arg 
530 535 =; 4 " 

Arg Thr Ala Ser Val Arg Ala Asp Thr Tyr Cys Arg Leu Tyr Ser 
545 550 - , - } - > 

Leu Ser Val Asp Asn Phe Asn Glu Val Leu Glu Glu Tyr Pro Met 
560 565 570 

Met Arg Arg Ala Phe Glu Thr Val Ala He Asp Arg Leu Asp Arg 
575 580 585 

He Gly Lys Lys Asn Ser He Leu Leu Gin Lys Phe Gin Lys Asp 
590 595 600 

Leu Asn Thr Gly Val Phe Asn Asn Gin Glu Asn Glu He Leu Lys 
605 610 615 

Gin He Val Lys His Asp Arg Glu Met Val Gin Ala He Ala Pro 
620 625 630 

He Asn Tyr Pro Gin Met Thr Thr Leu Asn Ser Thr Ser Ser Thr 
635 640 645 

Thr Thr Pro Thr Ser Arg Met Arg Thr Gin Ser Pro Pro Val Tyr 
650 655 660 

Thr Ala Thr Ser Leu Ser His Ser Asn Leu His Ser Pro Ser Pro 
665 670 675 

Ser Thr Gin Thr Pro Gin Pro Ser Ala He Leu Ser Pro Cys Ser 
680 685 690 

Tyr Thr Thr Ala Val Cys Ser Pro Pro Val Gin Ser Pro Leu Ala 
695 700 705 

Ala Arg Thr Phe His Tyr Ala Ser Pro Thr Ala Ser Gin Leu Ser 

710 715 
Leu Met Gin Gin Gin Pro Gin Gin Gin Val Gin Gin Ser Gin Pro 
725 73 0 735 

Pro Gin Thr Gin Pro Gin Gin Pro Ser Pro Gin Pro Gin Thr Pro 
740 745 /:}U 

Gly Ser Ser Thr Pro Lys Asn Glu Val His Lys Ser Thr Gin Ala 

755 760 
Leu His Asn Thr Asn Leu Thr Arg Glu Val Arg Pro Leu Ser Ala 
770 775 780 

Ser Gin Pro Ser Leu Pro His Glu Val Ser Thr Leu He Ser Arg 
785 790 795 

. Pro His Pro Thr Val Gly Glu Ser Leu Ala Ser He Pro Gin Pro 
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800 805 810 

Val Thr Ala Val Pro Gly Thr Gly Leu Gin Ala Gly Gly Arg Ser 

815 820 825 

Thr Val Pro Gin Arg Val Thr Leu Phe Arg Gin Met Ser Ser Gly 

830 835 840 

Ala He Pro Pro Asn Arg Gly Val Pro Pro Ala Pro Pro Pro Pro 

845 850 855 

Ala Ala Ala Leu Pro Arg Glu Ser Ser Ser Val Leu Asn Thr Asp 

860 865 870 

Pro Asp Ala Glu Lys Pro Arg Phe Ala Ser Asn Leu 

875 880 

<210> 12 
<211> 1547 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7480632CD1 

Met°val 2 Lys Lys Glu He Ser Val Arg Gin Gin He Gin Ala Leu 
1.5 10 15 

Leu Tyr Lys Asn Phe Leu Lys Lys Trp Arg He Lys Arg Glu Phe 
20 25 30 

Leu Glu Glu Trp Thr He Thr Leu Phe Leu Gly Leu Tyr Leu Cys 
35 40 45 

He Phe Ser Glu His Phe Arg Ala Thr Arg Phe Pro Glu Gin Pro 
50 55 60 

Pro Lys Val Leu Gly Ser Val Asp Gin Phe Asn Asp Ser Gly Leu 
65 70 75 

Val Val Ala Tyr Thr Pro Val Ser Asn He Thr Gin Arg He Met 
80 85 90 

Asn Lys Met Ala Leu Ala Ser Phe Met Lys Gly Arg Thr Val lie 
95 100 105 

Gly Thr Pro Asp Glu Glu Thr Met Asp He Glu Leu Pro Lys Lys 
HO H5 120 

Tyr His Glu Met Val Gly Val He Phe Ser Asp Thr Phe Ser Tyr 
125 130 135 

Arg Leu Lys Phe Asn Trp Gly Tyr Arg He Pro Val He Lys Glu 
140 145 150 

His Ser Glu Tyr Thr Gly His Cys Trp Ala Met His Gly Glu lie 
155 160 165 

Phe Cys Tyr Leu Ala Lys Tyr Trp Leu Lys Gly Phe Val Ala Phe 
170 175 180 

Gin Ala Ala He Asn Ala Ala He He Glu Val Thr Thr Asn His 
185 190 195 

Ser Val Met Glu Glu Leu Thr Ser Val He Gly He Asn Met Lys 
200 205 210 

He Pro Pro Phe He Ser Lys Gly Glu He Met Asn Glu Trp Phe 
215 220 225 

His Phe Thr Cys Leu Val Ser Phe Ser Ser Phe He Tyr Phe Ala 
230 235 240 

Ser Leu Asn Val Ala Arg Glu Arg Gly Lys Phe Lys Lys Leu Met 
245 250 255 

Thr Val Met Gly Leu Arg Glu Ser Ala Phe Trp Leu Ser Trp Gly 
260 265 270 

Leu Thr Tyr He Cys Phe He Phe He Met Ser He Phe Met Ala 
275 280 285 

Leu Val He Thr Ser He Pro He Val Phe His Thr Gly Phe Met 
290 295 300 

Val He Phe Thr Leu Tyr Ser Leu Tyr Gly Leu Ser Leu Val Ala 
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305 310 315 

Leu Ala Phe Leu Met Ser Val Leu He Arg Lys Pro Met Leu Ala 
320 325 - 3JU 

Gly Leu Ala Gly Phe Leu Phe Thr Val Phe Trp Gly Cys Leu Gly 

335 340 
Phe Thr Val Leu Tyr Arg Gin Leu Pro Leu Ser Leu Gly Trp Val 

350 355 
Leu Ser Leu Leu Ser Pro Phe Ala Phe Thr Ala Gly Met Ala Gin 

365 370 
He Thr His Leu Asp Asn Tyr Leu Ser Gly Val He Phe Pro Asp 

380 385 , t 0 

Pro Ser Gly Asp Ser Tyr Lys Met He Ala Thr Phe Phe He Leu 

395 400 4US 

Ala Phe Asp Thr Leu Phe Tyr Leu He Phe Thr Leu Tyr Phe Glu 

410 415 
Arg Val Leu Pro Gly Lys Asp Gly His Gly Asp Ser Pro Leu Phe 

425 430 * J -' 

Phe Leu Lys Ser Ser Phe Trp Ser Lys His Gin Asn Thr His His 

440 445 *= u 

Glu He Phe Glu Asn Glu He Asn Pro Glu His Ser Ser Asp Asp 

455 460 ftOD 

Ser Phe Glu Pro Val Ser Pro Glu Phe His Gly Lys Glu Ala He 

470 475 
Arg He Arg Asn Val He Lys Glu Tyr Asn Gly Lys Thr Gly Lys 

485 490 " D 

Val Glu Ala Leu Gin Gly He Phe Phe Asp He Tyr Glu Gly Gin 

500 505 r 

He Thr Ala He Leu Gly His Asn Gly Ala Gly Lys Ser Thr Leu 

515 ^20 
Leu Asn He Leu Ser Gly Leu Ser Val Ser Thr Glu Gly Ser Ala 

530 535 3ftU 

Thr lie Tyr Asn Thr Gin Leu Ser Glu He Thr Asp Met Glu Glu 

545 550 000 

He Arg Lys Asn He Gly Phe Cys Pro Gin Phe Asn Phe Gin Phe 

560 565 
Asp Phe Leu Thr Val Arg Glu Asn Leu Arg Val Phe Ala Lys He 

575 580 
Lys Gly He Gin Pro Lys Glu Val Glu Gin Glu Val Leu Leu Leu 

590 595 
Asp Glu Pro Thr Ala Gly Leu Asp Pro Phe Ser Arg His Arg Val 

Trp Ser Leu Leu Lys Glu His Lys Val Asp Arg Leu He Leu Phe 

Ser Thr Gin Phe Met Asp Glu Ala Asp He Leu Ala Asp Arg Lys 

635 640 ? 3 

Val Phe Leu Ser Asn Gly Lys Leu Lys Cys Ala Gly Ser Ser Leu 

650 655 660 

Phe Leu Lys Arg Lys Trp Gly He Gly Tyr His Leu Ser Leu Hxs 

665 670 
Arg Asn Glu Met Cys Asp Thr Glu Lys He Thr Ser Leu He Lys 

680 685 690 

Gin His He Pro Asp Ala Lys Leu Thr Thr Glu Ser Glu Glu Lys 

695 700 705 

Leu Val Tyr Ser Leu Pro Leu Glu Lys Thr Asn Lys Phe Pro Asp 

710 715 720 

Leu Tyr Ser Asp Leu Asp Lys Cys Ser Asp Gin Gly He Arg Asn 

725 7 ^0 
Tyr Ala Val Ser Val Thr Ser Leu Asn Glu Val Phe Leu Asn Leu 

740 74 5 
Glu Gly Lys Ser Ala He Asp Glu Pro Asp Phe Asp He Gly Lys 

755 760 765 

Gin Glu Lys He His Val Thr Arg Asn Thr Gly Asp Glu Ser Glu 

770 775 780 
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Met Glu Gin Val Leu Cys Ser Leu Pro Glu Thr Arg Lys Ala Val 

785 790 795 

Ser Ser Ala Ala Leu Trp Arg Arg Gin He Tyr Ala Val Ala Thr 

800 805 810 

Leu Arg Phe Leu Lys Leu Arg Arg Glu Arg Arg Ala Leu Leu Cys 

815 820 825 

Leu Leu Leu Val Leu Gly He Ala Phe He Pro He He Leu Glu 

830 835 840 

Lys He Met Tyr Lys Val Thr Arg Glu Thr His Cys Trp Glu Phe 

845 850 855 

Ser Pro Ser Met Tyr Phe Leu Ser Leu Glu Gin He Pro Lys Thr 

860 865 870 

Pro Leu Thr Ser Leu Leu He Val Asn Asn Thr Gly Ser Asn He 

875 880 885 

Glu Asp Leu Val His Ser Leu Lys Cys Gin Asp He Val Leu Glu 

890 895 900 

He Asp Asp Phe Arg Asn Arg Asn Gly Ser Asp Asp Pro Ser Tyr 

905 910 915 

Asn Gly Ala He He Val Ser Gly Asp Gin Lys Asp Tyr Arg Phe 

920 925 930 

Ser Val Ala Cys Asn Thr Lys Lys Leu Asn Cys Phe Pro Val Leu 

935 940 945 

Met Gly He Val Ser Asn Ala Leu Met Gly He Phe Asn Phe Thr 

950 955 960 

Glu Leu He Gin Met Glu Ser Thr Ser Phe Phe Phe Tyr He Thr 

965 970 975 

Thr Lys Ser Phe Gin Thr Lys He Pro Ser Ser He Pro Ser lie 

980 985 990 

Leu Cys Gin Lys Asn Val Gin Ser Gin Leu Trp He Ser Gly Leu 

995 1000 1005 

Trp Pro Ser Ala Tyr Trp Cys Gly Gin Ala Leu Val Asp He Pro 

1010 1015 1° 20 

Leu Tyr Phe Leu He Leu Phe Ser He His Leu lie Tyr Tyr Phe 

1025 1030 1035 

lie Phe Leu Gly Phe Gin Leu Ser Trp Glu Leu Met Phe Val Leu 

1040 1045 1050 

Val Val Cys He He Gly Cys Ala Val Ser Leu He Phe Leu Thr 

1055 1060 1065 

Tyr Val Leu Ser Phe He Phe Arg Lys Trp Arg Lys Asn Asn Gly 

1070 1075 1080 

Phe Trp Ser Phe Gly Phe Phe He Val Ser He Tyr Thr Asp Phe 

1085 1090 1095 

Ser Phe His Tyr Asn Val Ser Arg Cys Asp Phe Leu Phe He Phe 

1100 H05 H10 

lie Phe Val Cys Leu Phe He Ala His His Phe Ser Phe Cys Ser 

1115 1120 H25 

Pro Tyr Leu Gin Ser Val He Phe Leu Phe Val lie Arg Cys Leu 

1130 1135 H40 

Glu Met Lys Tyr Gly Asn Glu He Met Asn Lys Asp Pro Val Phe 

1145 1150 H55 

Arg He Ser Pro Arg Ser Arg Glu Thr His Pro Asn Pro Glu Glu 

1160 1165 1170 

Pro Glu Glu Glu Asp Glu Asp Val Gin Ala Glu Arg Val Gin Ala 

1175 1180 H85 

Ala Asn Ala Leu Thr Ala Pro Asn Leu Glu Glu Glu Pro Val He 

1190 1195 1200 

Thr Ala Ser Cys Leu His Lys Glu Tyr Tyr Glu Thr Lys Lys Ser 

1205 1210 1215 

Cys Phe Ser Thr Arg Lys Lys Lys He Ala He Arg Asn Val Ser 

1220 1225 1230 

Phe Cys Val Lys Lys Gly Glu Val Leu Gly Leu Leu Gly His Asn 

1235 1240 1245 

Gly Ala Gly Lys Ser Thr Ser He Lys Met He Thr Gly Cys Thr 
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1250 1255 1260 

Lys Pro Thr Ala Gly Val Val Val Leu Gin Gly Ser Arg Ala Ser 
1265 1270 1275 

Val Arg Gin Gin His Asp Asn Ser Leu Lys Phe Leu Gly Tyr Cys 
1280 1285 1290 

Pro Gin Glu Asn Ser Leu Trp Pro Lys Leu Thr Met Lys Glu His 

1295 I 300 13 

Leu Glu Leu Tyr Ala Ala Val Lys Gly Leu Gly Lys Glu Asp Ala 

1310 1315 132 

Ala Leu Ser He Ser Arg Leu Val Glu Ala Leu Lys Leu Gin Glu 
1325 1330 13 35 

Gin Leu Lys Ala Pro Val Lys Thr Leu Ser Glu Gly He Lys Arg 
1340 1345 J-Jsu 

Lys Leu Cys Phe Val Leu Ser He Leu Gly Asn Pro Ser Val Val 
1355 1360 1365 

Leu Leu Asp Glu Pro Phe Thr Gly Met Asp Pro Glu Gly Gin Gin 
1370 1375 1 JB0 

Gin Met Trp Gin He Leu Gin Ala Thr Val Lys Asn Lys Glu Arg 
1385 1390 1395 

Gly Thr Leu Leu Thr Thr His Tyr Met Ser Glu Ala Glu Ala Val 
1400 1405 1410 

Cys Asp Arg Met Ala Met Met Val Ser Gly Thr Leu Arg Cys lie 
1415 1420 1425 

Gly Ser He Gin His Leu Lys Asn Lys Phe Gly Arg Asp Tyr Leu 

1430 1435 
Leu Glu He Lys Met Lys Glu Pro Thr Gin Val Glu Ala Leu His 
1445 1450 1455 

Thr Glu He Leu Lys Leu Phe Pro Gin Ala Ala Trp Gin Glu Arg 
1460 1465 1470 

Tyr Ser Ser Leu Met Ala Tyr Lys Leu Pro Val Glu Asp Val His 
I475 1480 1485 

Pro Leu Ser Arg Ala Phe Phe Lys Leu Glu Ala Met Lys Gin Thr 
1490 1495 "OU 

Phe Asn Leu Glu Glu Tyr Ser Leu Ser Gin Ala Thr Leu Glu Gin 
1505 1510 1515 

Val Phe Leu Glu Leu Cys Lys Glu Gin Glu Leu Gly Asn Val Asp 
1520 1525 1530 

Asp Lys He Asp Thr Thr Val Glu Trp Lys Leu Leu Pro Gin Glu 
I535 1540 

Asp Pro 



<210> 13 

<211> 698 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6952742CD1 

Met°Asp 3 Glu Ser Pro Glu Pro Leu Gin Gin Gly Arg Gly Pro Val 
15 10 15 

Pro Val Arg Arg Gin Arg Pro Ala Pro Arg Gly Leu Arg Glu Met 

20 25 30 

Leu Lys Ala Arg Leu Trp Cys Ser Cys Ser Cys Ser Val Leu Cys 

35 40 *D 

Val Arg Ala Leu Val Gin Asp Leu Leu Pro Ala Thr Arg Trp Leu 

50 55 60 

Arg Gin Tyr Arg Pro Arg Glu Tyr Leu Ala Gly Asp Val Met Ser 

65 70 75 

Gly Leu Val He Gly He He Leu Ala He Ala Tyr Ser Leu Leu 
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80 85 90 

Ala Gly Leu Gin Pro He Tyr Ser Leu Tyr Thr Ser Phe Phe Ala 
95 100 10:5 

Asn Leu He Tyr Phe Leu Met Gly Thr Ser Arg His Val Ser Val 

110 115 1 

Gly lie Phe Ser Leu Leu Cys Leu Met Val Gly Gin Val Val Asp 

125 J-- 33 
Arg Glu Leu Gin Leu Ala Gly Phe Asp Pro Ser Gin Asp Gly Leu 

140 145 
Gin Pro Gly Ala Asn Ser Ser Thr Leu Asn Gly Ser Ala Ala Met 

155 1^0 
Leu Asp Cys Gly Arg Asp Cys Tyr Ala He Arg Val Ala Thr Ala 

17a 1 75 180 

Leu Thr Leu Met Thr Gly Leu Tyr Gin Val Leu Met Gly Val Leu 

185 I 90 195 

Arg Leu Gly Phe Val Ser Ala Tyr Leu Ser Gin Pro Leu Leu Asp 

Gly Phe Ala Met Gly Ala Ser Val Thr He Leu Thr Ser Gin Leu 

215 220 
Lys His Leu Leu Gly Val Arg He Pro Arg His Gin Gly Pro Gly 

230 235 
Met Val Val Leu Thr Trp Leu Ser Leu Leu Arg Gly Ala Gly Gin 

245 250 
Ala Asn Val Cys Asp Val Val Thr Ser Thr Val Cys Leu Ala Val 

260 265 . 270 

Leu Leu Ala Ala Lys Glu Leu Ser Asp Arg Tyr Arg His Arg Leu 

275 280 
Arg Val Pro Leu Pro Thr Glu Leu Leu Val He Val Val Ala Thr 

Leu Val Ser His Phe Gly Gin Leu His Lys Arg Phe Gly Ser Ser 

305 310 315 

Val Ala Gly Asp He Pro Thr Gly Phe Met Pro Pro Gin Val Pro 

320 325 
Glu Pro Arg Leu Met Gin Arg Val Ala Leu Asp Ala Val Ala Leu 

335 340 J<tD 

Ala Leu Val Ala Ala Ala Phe Ser He Ser Leu Ala Glu Met Phe 

350 355 360 

Ala Arg Ser His Gly Tyr Ser Val Arg Ala Asn Gin Glu Leu Leu 

365 370 
Ala Val Gly Cys Cys Asn Val Leu Pro Ala Phe Leu His Cys Phe 

380 385 39 

Ala Thr Ser Ala Ala Leu Ala Lys Ser Leu Val Lys Thr Ala Thr 

395 400 405 

Gly Cys Arg Thr Gin Leu Ser Ser Val Val Ser Ala Thr Val Val 

Leu Leu Val Leu Leu Ala Leu Ala Pro Leu Phe His Asp Leu Gin 

425 430 
Arg Ser Val Leu Ala Cys Val He Val Val Ser Leu Arg Gly Ala 

440 445 4:5U 

Leu Arg Lys Val Trp Asp Leu Pro Arg Leu Trp Arg Met Ser Pro 

Ala Asp Ala Leu Val Trp Ala Gly Thr Val Ala Thr Cys Met Leu 

470 475 480 

Val Ser Thr Glu Ala Gly Leu Leu Ala Gly Val He Leu Ser Leu 

485 490 495. 

Leu Ser Leu Ala Gly Arg Thr Gin Ser His Gly Thr Ala Leu Leu 

500 505 510 

Ala Arg He Gly Asp Thr Ala Phe Tyr Glu Asp Ala Thr Glu Phe 

515 520 
Glu Gly Leu Val Pro Glu Pro Gly Val Arg Val Phe Arg Phe Gly 

530 535 => 4U 

Glv Pro Leu Tyr Tyr Ala Asn Lys Asp Phe Phe Leu Gin Ser Leu 

545 550 555 
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Tyr Ser Leu Thr Gly Leu Asp Ala Gly Cys Met Ala Ala Arg Arg 

Lys Glu Gly Gly Ser Glu Thr Gly Val Gly Glu Gly Gly Pro Ala 

575 580 585 

Gin Gly Glu Asp Leu Gly Pro Val Ser Thr Arg Ala Ala Leu Val 

590 595 600 

Pro Ala Ala Ala Gly Phe His Thr Val Val He Asp Cys Ala Pro 

605 610 615 

Leu Leu Phe Leu Asp Ala Ala Gly Val Ser Thr Leu Gin Asp Leu 

620 625 630 

Arg Arg Asp Tyr Gly Ala Leu Gly He Ser Leu Leu Leu Ala Cys 

635 640 o4b 

Cys Ser Pro Pro Val Arg Asp He Leu Ser Arg Gly Gly Phe Leu 

650 655 660 

Gly Glu Gly Pro Gly Asp Thr Ala Glu Glu Glu Gin Leu Phe Leu 

665 670 675 

Ser Val His Asp Ala Val Gin Thr Ala Arg Ala Arg His Arg Glu 

680 685 690 

Leu Glu Ala Thr Asp Ala His Leu 

695 

<210> 14 

<211> 766 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7478795CD1 

Met°Arg 4 Leu Trp Lys Ala Val Val Val Thr Leu Ala Phe Met Ser 
1 5 10 15 

Val Asp He Cys Val Thr Thr Ala He Tyr Val Phe Ser His Leu 
20 25 30 

Asp Arg Ser Leu Leu Glu Asp He Arg His Phe Asn He Phe Asp 

Ser Val Leu Asp Leu Trp Ala Ala Cys Leu Tyr Arg Ser Cys Leu 
50 55 60 

Leu Leu Gly Ala Thr He Gly Val Ala Lys Asn Ser. Ala Leu Gly 
65 70 75 

Pro Arg Arg Leu Arg Ala Ser Trp Leu Val He Thr Leu Val Cys 
80 85 ■ 5,0 

Leu Phe Val Gly He Tyr Ala Met Val Lys Leu Leu Leu Phe Ser 
95 100 105 

Glu Val Arg Arg Pro He Arg Asp Pro Trp Phe Trp Ala Leu Phe 

110 115 120 

Val Trp Thr Tyr He Ser Leu Gly Ala Ser Phe Leu Leu Trp Trp 

125 130 135 

Leu Leu Ser Thr Val Arg Pro Gly Thr Gin Ala Leu Glu Pro Gly 

140 145 150 

Ala Ala Thr Glu Ala Glu Gly Phe Pro Gly Ser Gly Arg Pro Pro 

155 160 165 

Pro Glu Gin Ala Ser Gly Ala Thr Leu Gin Lys Leu Leu Ser Tyr 

170 175 180 

Thr Lys Pro Asp Val Ala Phe Leu Val Ala Ala Ser Phe Phe Leu 

185 190 195 

He Val Ala Ala Leu Gly Glu Thr Phe Leu Pro Tyr Tyr Thr Gly 
~ 200 205 210 

Arg Ala He Asp Gly He Val He Gin Lys Ser Met Asp Gin Phe 

215 220 225 

Ser Thr Ala Val Val He Val Cys Leu Leu Ala He Gly Ser Ser 

230 235 240 
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Phe Ala Ala Gly lie Arg Gly Gly He Phe Thr Leu He Phe Ala 
245 ^50 "° 

Arg Leu Asn He Arg Leu Arg Asn Cys Leu Phe Arg Ser Leu Val 
260 265 270 

Ser Gin Glu Thr Ser Phe Phe Asp Glu Asn Arg Thr Gly Asp Leu 
275 280 

He Ser Arg Leu Thr Ser Asp Thr Thr Met Val Ser Asp Leu Val 

290 295 
Ser Gin Asn He Asn Val Phe Leu Arg Asn Thr Val Lys Val Thr 
305 310 •J" 

Gly Val Val Val Phe Met Phe Ser Leu Ser Trp Gin Leu Ser Leu 
320 325 330 

Val Thr Phe Met Gly Phe Pro He He Met Met Val Ser Asn He 
335 340 i4b 

Tyr Gly Lys Tyr Tyr Lys Arg Leu Ser Lys Glu Val Gin Asn Ala 
350 355 360 

Leu Ala Arg Ala Ser Asn Thr Ala Glu Glu Thr He Ser Ala Met 
365 370 ■ 3/3 

Lys Thr Val Arg Ser Phe Ala Asn Glu Glu Glu Glu Ala Glu Val 
380 385 J»u 

Tyr Leu Arg Lys Leu Gin Gin Val Tyr Lys Leu Asn Arg Lys Glu 
395 400 4UD 

Ala Ala Ala Tyr Met Tyr Tyr Val Trp Gly Ser Gly Leu Thr Leu 

410 415 „ 

Leu Val Val Gin Val Ser He Leu Tyr Tyr Gly Gly His Leu Val 
425 *30 4 - J - 3 

lie Ser Gly Gin Met Thr Ser Gly Asn Leu He Ala Phe He He 
440 445 , 

Tyr Glu Phe Val Leu Gly Asp Cys Met Glu Ser Val Gly Ser Val 
* 455 460 465 

Tyr Ser Gly Leu Met Gin Gly Val Gly Ala Ala Glu Lys Val Phe 

470 475 
Glu Phe He Asp Arg Gin Pro Thr Met Val His Asp Gly Ser Leu 

485 490 
Ala Pro Asp His Leu Glu Gly Arg Val Asp Phe Glu Asn Val Thr 
500 505 - )J - U 

Phe Thr Tyr Arg Thr Arg Pro His Thr Gin Val Leu Gin Asn Val 

515 ^20 
Ser Phe Ser Leu Ser Pro Gly Lys Val Thr Ala Leu Val Gly Pro 
530 535 D4fcU 

Ser Gly Ser Gly Lys Ser Ser Cys Val Asn He Leu Glu Asn Phe 
545 550 sod 

Tyr Pro Leu Glu Gly Gly Arg Val Leu Leu Asp Gly Lys Pro He 
560 565 3/u 

Ser Ala Tyr Asp His Lys Tyr Leu His Arg Val He Ser Leu Val 
575 580 585 

Ser Gin Glu Pro Val Leu Phe Ala Arg Ser He Thr Asp Asn He 

590 595 
Ser Tyr Gly Leu Pro Thr Val Pro Phe Glu Met Val Val Glu Ala 

605 610 
Ala Gin Lys Ala Asn Ala His Gly Phe He Met Glu Leu Gin Asp 
620 625 "° 

Gly Tyr Ser Thr Glu Thr Gly Glu Lys Gly Ala Gin Leu Ser Gly 

635 640 
Gly Gin Lys Gin Arg Val Ala Met Ala Arg Ala Leu Val Arg Asn 
650 655 660 

Pro Pro Val Leu He Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala 

665 670 6 

Glu Ser Glu Tyr Leu He Gin Gin Ala He His Gly Asn Leu Gin 

680 685 69 

Lys His Thr Val Leu He He Ala His Arg Leu Ser Thr Val Glu 
695 700 705 

His Ala His Leu He Val Val Leu Asp Lys Gly Arg Val Val Gin 
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710 715 • 720 

Gin Gly Thr His Gin Gin Leu Leu Ala Gin Gly Gly Leu Tyr Ala 

725 730 
Lys Leu Val Gin Arg Gin Met Leu Gly Leu Gin Pro Ala Ala Asp 

740 7 ^ 5 
Phe Thr Ala Gly His Asn Glu Pro Val Ala Asn Gly Ser His Lys 

755 760 /OD 

Ala 



<210> 15 

<211> 450 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> miscJEeature 

<223> Incyte ID No: 656293CD1 

Me2°GlJ\eu Arg Ser His His Leu Ser Leu Gly Leu Leu Leu Leu 

- 5 10 

Phe Leu Leu Pro Ala Glu Cys Leu Gly Ala Glu Gly Arg Leu Ala 

20 25 Dyj 

Leu Lys Leu Phe Arg Asp Leu Phe Ala Asn Tyr Thr Ser Ala Leu 

35 40 * 3 

Arg Pro Val Ala Asp Thr Asp Gin Thr Leu Asn Val Thr Leu Glu 

50 

Val Thr Leu Ser Gin lie lie Asp Met Asp Glu Arg Asn Gin Val 

65 70 
Leu Thr Leu Tyr Leu Trp lie Arg Gin Glu Trp Thr Asp Ala Tyr 

80 

Leu' Arg Trp Asp Pro Asn Ala Tyr Gly Gly Leu Asp Ala lie Arg 
lie Pro Ser Ser Leu Val Trp Arg Pro Asp He Val Leu Tyr Asn 

no 

Lys Ala Asp Ala Gin Pro Pro Gly Ser Ala Ser Thr Asn Val Val 

125 130 
Leu Arg His Asp Gly Ala Val Arg Trp Asp Ala Pro Ala He Thr 

140 * 
Arg Ser Ser Cys Arg Val Asp Val Ala Ala Phe Pro Phe Asp Ala 
155 160 X03 

Gin His Cys Gly Leu Thr Phe Gly Ser Trp Thr His Gly Gly His 
170 175 iOU 

Gin Leu Asp Val Arg Pro Arg Gly Ala Ala Ala Ser Leu Ala Asp 

185 190 
Phe Val Glu Asn Val Glu Trp Arg Val Leu Gly Met Pro Ala Arg 
200 205 * 1K > 

Arg Arg Val Leu Thr Tyr Gly Cys Cys Ser Glu Pro Tyr Pro Asp 
215 220 ? 

Val Thr Phe Thr Leu Leu Leu Arg Arg Arg Ala Ala Ala Tyr Val 

230 235 
Cys Asn Leu Leu Leu Pro Cys Val Leu He Ser Leu Leu Ala Pro 

245 250 
Leu Ala Phe His Leu Pro Ala Asp Ser Gly Glu Lys Val Ser Leu 

260 265 t 

Gly Val Thr Val Leu Leu Ala Leu Thr Val Phe Gin Leu Leu Leu 

275 280 
Ala Glu Ser Met Pro Pro Ala Glu Ser Val Pro Leu He Gly Lys 

290 295 
Tyr Tyr Met Ala Thr Met Thr Met Val Thr Phe Ser Thr Ala Leu 

305 310 
Thr He Leu He Met Asn Leu His Tyr Cys Gly Pro Ser Val Arg 
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Pro Val 
Arg Gly 
Arg Pro 
Ala Gly 
Arg Gin 
Phe Arg 
Arg Leu 
Ser Met 



320 

Pro Ala Trp 
335 

Leu Cys Val 
350 

Pro Glu Leu 
365 

Pro Pro Ala 
380 

Glu Ala Leu 
395 

Ser His Arg 
410 

Ala Arg Val 
425 

Ala Leu Val 
440 



Ala Arg 
Arg Glu 
Ser Pro 
Gly Pro 
Leu His 
Ala Ala 
Met Asp 
Met Ser 



325 

Ala Leu Leu 
340 

Arg Gly Glu 
355 

Ser Pro Gin 
370 

Cys His Glu 
385 

His Val Ala 
400 

Gin Arg Cys 
415 

Arg Phe Phe 
430 

Leu Leu Val 
445 



Leu Gly 
Pro Cys 
Ser Pro 
Pro Arg 
Thr He 
His Glu 
Leu Ala 
Leu Val 



330 

His Leu Ala 
345 

Gly Gin Ser 
360 

Glu Gly Gly 
375 

Cys Leu Cys 
390 

Ala Asn Thr 
405 

Asp Trp Lys 
420 

He Phe Phe 
435 

Gin Ala Leu 
450 



<210> 16 

<211> 260 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7473957CD1 

Met°Pro 6 Ile Leu Ala Asn Leu Pro Gly Met Ser Ser Pro Arg Ala 

1 5 10 15 

Met Glu Phe Thr Ser Ser Gly Ser Ala Asn Thr Glu Thr Thr Lys 

20 25 30 

Val Thr Gly Ser Leu Glu Thr Lys Tyr Arg Trp Thr Glu Tyr Gly 

35 40 45 
Leu Thr Phe Thr Glu Lys Trp Asn Thr Asp Asn Thr Leu Gly Thr 

Glu He Thr Val Glu Asp Gin Leu Ala Arg Gly Leu Lys Leu Thr 
65 70 75 

Phe Asp Ser Ser Phe Ser Pro Asn Thr Gly Lys Lys Asn Ala Lys 
80 85 90 

He Lys Thr Gly Tyr Lys Arg Glu His He Asn Leu Gly Cys Asp 
95 100 105 

Met Asp Phe Asp He Ala Gly Pro Ser He Arg Gly Ala Leu Val 

110 11 5 xzu 

Leu Gly Tyr Glu Gly Trp Leu Ala Gly Tyr Gin Met Asn Phe Glu 

125 130 i- 3 - 5 

Thr Ala Lys Ser Arg Val Thr Gin Ser Asn Phe Ala Val Gly Tyr 

140 I 45 „, lZ 

Lys Thr Asp Glu Phe Gin Leu His Thr Asn Val Asn Asp Gly Thr 

155 I 50 1 

Glu Phe Gly Gly Ser lie Tyr Gin Lys Val Asn Lys Lys Leu Glu 

170 175 280 

Thr Ala Val Asn Leu Ala Trp Thr Ala Gly Asn Ser Asn Thr Arg 

185 190 " 5 

Phe Gly He Ala Ala Lys Tyr Gin He Asp Pro Asp Ala Cys Phe 

200 205 210 

Ser Ala Lys Val Asn Asn Ser Ser Leu lie Gly Leu Gly Tyr Thr 

215 220 225 

Gin Thr Leu Lys Pro Gly He Lys Leu Thr Leu Ser Ala Leu Leu 

230 235 240 

Asp Gly Lys Asn Val Asn Ala Gly Gly His Lys Leu Gly Leu Gly 

245 250 ^ 
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Leu Glu Phe Gin Ala 
260 

<210> 17 

<211> 506 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7474111CD1 

<400> 17 

Met Ser Glu Pro Glu Leu Gly Ser Gly Gin Phe Leu Glu Lys Ala 
I 5 10 15 

Leu Gin Thr Pro Ser Val Pro Ala Pro Glu Ser Thr Leu Gly Phe 

20 25 30 

Glu Pro Gly Leu Leu Lys Gly Ala Leu Gly Thr Ala Gin Phe He 
35 40 45 

Pro Met Ala Gin Gly Arg Thr Arg Glu Gin Ala Ser Arg Arg Trp 
50 55 60 

Ala Pro Arg Ser Pro Ala Leu Arg Thr Pro Pro Arg His Tyr Gly 
65 70 75 

Pro Glu Arg Arg Gly Arg Thr Ala Ser Arg Gly Gly Glu Pro Glu 
80 85 90 

Val Gin Gly Gly Ala Pro Gly Asn Pro Ser Pro Ser Lys Pro Gly 
95 100 105 

Ser Pro Gin Gly Val Gly Pro Ala Ala Trp Glu Arg Ala Pro Arg 

110 H5 120 

Pro Arg Cys Ala Gin Pro Ser Gly Ala Arg Val Gly Glu Arg Thr 

125 130 135 

Gin Pro Arg Ser Gin Pro Val Gly Leu Ser Arg Gly Ala Gly Glu 

140 145 150 

Asp Ser Pro Ala Thr Arg Ser Gly Ala Ala Ser Val Val Leu Asn 

155 160 165 

Val Gly Gly Ala Arg Tyr Ser Leu Ser Arg Glu Leu Leu Lys Asp 

170 175 180 

Phe Pro Leu Arg Arg Val Ser Arg Leu His Gly Cys Arg Ser Glu 

185 190 195 

Arg Asp Val Leu Glu Val Cys Asp Asp Tyr Asp Arg Glu Arg Asn 

200 205 210 

Glu Tyr Phe Phe Asp Arg His Ser Glu Ala Phe Gly Phe He Leu 

215 220 225 

Leu Tyr Ala Ala Pro Ser Arg Arg Trp Leu Glu Arg Met Arg Arg 

230 235 240 

Thr Phe Glu Glu Pro Thr Ser Ser Leu Ala Ala Gin He Leu Ala 

245 250 255 

Ser Val Ser Val Val Phe Val He Val Ser Met Val Val Leu Cys 

260 265 270 

Ala Ser Thr Leu Pro Asp Trp Arg Asn Ala Ala Ala Asp Asn Arg 

275 280 285 

Ser Leu Asp Asp Arg Ser Arg He He Glu Ala He Cys He Gly 

290 295 300 

Trp Phe Thr Ala Glu Cys He Val Arg Phe He Val Ser Lys Asn 

305 310 315 

Lys Cys Glu Phe Val Lys Arg Pro Leu Asn He He Asp Leu Leu 

320 ~~ 325 330 

Ala He Thr Pro Tyr Tyr He Ser Val Leu Met Thr Val Phe Thr 

335 340 345 

Gly Glu Asn Ser Gin Leu Gin Arg Ala Gly Val Thr Leu Arg Val 

350 355 360 

Leu Arg Met Met Arg He Phe Trp Val He Lys Leu Ala Arg His 

365 370 375 
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Phe 


lie 


Gly Leu Gin Thr 






380 


Arg 


Glu 


Met Val Met Leu 




395 


lie 


Phe 


Ser Ala Leu Ser 






410 


Glu 


Thr 


Ser Asn Lys Asp 






425 


Trp 


Val 


He He Ser Met 




440 


Pro 


He 


Thr Val Pro Gly 






455 


Ser 


Gly 


He Val Leu Leu 




470 


Ser 


Phe 


Val Gin Cys Tyr 






485 


Tyr 


Ser 


Arg Ser Leu Ser 






500 



WO 02/12340 PCTAJS01/24217 



385 390 
x Val Phe He Cys Val Ala Met Ala 
400 405 
a Leu Leu Glu His Gly Leu Asp Leu 
415 420 
e Thr Ser He Pro Ala Ala Cys Trp 
430 435 
r Thr Val Gly Tyr Gly Asp Met Tyr 
445 450 
g He Leu Gly Gly Val Cys Val Val 
460 465 
a Leu- Pro He- Thr Phe He Tyr His 
475 480 
s Glu Leu Lys Phe Arg Ser Ala Arg 
490 495 
r Glu Phe Leu Asn 
505 

<210> 18 
<211> 506 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7480826CD1 

<400> 18 

Met Lys Lys Ala Glu Met Gly Arg- Phe Ser He Ser Pro Asp Glu 
15 10 I 5 

Asp Ser Ser Ser Tyr Ser Ser Asn Ser Asp Phe Asn Tyr Ser Tyr 
20 25 30 

Pro Thr Lys Gin Ala Ala Leu Lys Ser His Tyr Ala Asp Val Asp 
35 40 45 

Pro Glu Asn Gin Asn Phe Leu Leu Glu Ser Asn Leu Gly Lys Lys 
50 55 60 

Lys Tyr Glu Thr Glu Phe His Pro Gly Thr Thr Ser Phe Gly Met 
65 70 75 

Ser Val Phe Asn Leu Ser Asn Ala He Val Gly Ser Gly He Leu 
80 85 90 

Gly Leu Ser Tyr Ala Met Ala Asn Thr Gly He Ala Leu Phe He 
95 100 105 

He Leu Leu Thr Phe Val Ser He Phe Ser Leu Tyr Ser Val His 
HO 115 120 

Leu Leu Leu Lys Thr Ala Asn Glu Gly Gly Ser Leu Leu Tyr Glu 
125 130 135 

Gin Leu Gly Tyr Lys Ala Phe Gly Leu Val Gly Lys Leu Ala Ala 
140 145 150 

Ser Gly Ser He Thr Met Gin Asn He Gly Ala Met Ser Ser Tyr 
155 160 165 

Leu Phe He Val Lys Tyr Glu Leu Pro Leu Val He Gin Ala Leu 
170 175 180 

Thr Asn He Glu Asp Lys Thr Gly Leu Trp Tyr Leu Asn Gly Asn 
185 190 195 

Tyr Leu Val Leu Leu Val Ser Leu Val Val He Leu Pro Leu Ser 
200 205 210 

Leu Phe Arg Asn Leu Gly Tyr Leu Gly Tyr Thr Ser Gly Leu Ser 
215 220 225 

Leu Leu Cys Met Val Phe Phe Leu He Val Val He Cys Lys Lys 
230 235 240 

Phe Gin Val Pro Cys Pro Val Glu Ala Ala Leu He He Asn Glu 
245 250 255 
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Thr lie Asn Thr Thr Leu Thr Gin Pro Thr Ala Leu Val Pro Ala 
260 265 270 

Leu Ser His Asn Val Thr Glu Asn Asp Ser Cys Arg Pro His Tyr 

275 280 285 

Phe He Phe Asn Ser Gin Thr Val Tyr Ala Val Pro He Leu He 

290 295 300 

Phe Ser Phe Val Cys His Pro Ala Val Leu Pro He Tyr Glu Glu 

305 310 315 

Leu Lys Asp Arg Ser Arg Arg Arg Met Met Asn Val Ser Lys He 

320 325 330 

Ser Phe Phe Ala Met Phe Leu Met Tyr Leu Leu Ala Ala Leu Phe 

335 340 345 

Gly Tyr Leu Thr Phe Tyr Glu His Val Glu Ser Glu Leu Leu His 

350 355 360 

Thr Tyr Ser Ser He Leu Gly Thr Asp He Leu Leu Leu He Val 

365 370 375 

Arg Leu Ala Val Leu Met Ala Val Thr Leu Thr Val Pro Val Val 

380 385 390 

He Phe Pro He Arg Ser Ser Val Thr His Leu Leu Cys Ala Ser 

395 400 405 

Lys Asp Phe Ser Trp Trp Arg His Ser Leu He Thr Val Ser He 

410 415 420 

Leu Ala Phe Thr Asn Leu Leu Val He Phe Val Pro Thr He Arg 

425 430 435 

Asp He Phe Gly Phe He Gly Ala Ser Ala Ala Ser Met Leu He 

440 445 450 

Phe He Leu Pro Ser Ala Phe Tyr He Lys Leu Val Lys Lys Glu 

455 460 465 

Pro Met Lys Ser Val Gin Lys He Gly Ala Leu Phe Phe Leu Leu 

470 475 480 

Ser Gly Val Leu Val Met Thr Gly Ser Met Ala Leu He Val Leu 

485 490 495 

Asp Trp Val His Asn Ala Pro Gly Gly Gly His 

500 505 

<210> 19 
<2il> 315 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6025572CD1 



10 15 
Leu Ala Gly Gly Val Ala 

25 30 
Pro He Glu Arg Val Lys 

40 45 
Lys Gin He Ser Pro Glu 

55 60 
Leu Val Arg He Pro Arg 

70 75 
Gly Asn Leu Ala Asn Val 

85 90 
Asn Phe Ala Phe Lys Asp 
100 105 
Val Asn Lys Glu Lys Gin 
115 120 
Ala Ser Gly Gly Ala Ala 
130 135 



<400> 19 








Met 


His 


Arg 


Glu 


Pro 


Ala 


1 








5 




Asp 


Ala 


Ser 


Ser 


Phe Gly 








20 




Ala 


Ala 


Val 


Ser 


Lys 
35 


Thr 


Leu 


Leu 


Leu 


Gin 


Val 
50 


Gin 


Ala 


Arg 


Tyr 


Lys 


Gly Met 










65 




Glu 


Gin 


Gly 


Phe 


Phe 
80 


Ser 


He 


Arg 


Tyr 


Phe 


Pro 
95 


Thr 


Lys 


Tyr 


Lys 


Gin 


Leu 
110 


Phe 


Phe 


Trp 


Arg 


Trp 


Phe 
125 


Leu 
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Glv Ala Thr Ser Leu Cys Val Val Tyr Pro Leu Asp Phe Ala Arg 

140 145 150 

Thr Arg Leu Gly Val Asp lie Gly Lys Gly Pro Glu Glu Arg Gin 

155 160 165 

Phe Lys Gly Leu Gly Asp Cys He Met Lys He Ala Lys Ser Asp 

170 * 175 180 

Gly He Ala Gly Leu Tyr Gin Gly Phe Gly Val Ser Val Gin Gly 

185 190 195 

He He Val Tyr Arg Ala Ser Tyr Phe Gly Ala Tyr Asp Thr Val 

200 205 210 

Lys Gly Leu Leu Pro Lys Pro Lys Lys Thr Pro Phe Leu Val Ser 

215 220 225 

Phe Phe He Ala Gin Val Val Thr Thr Cys Ser Gly He Leu Ser 

230 235 240 

Tvr Pro Phe Asp Thr Val Arg Arg Arg Met Met Met Gin Ser Gly 

245 250 255 

Glu Ala Lys Arg Gin Tyr Lys Gly Thr Leu Asp Cys Phe Val Lys 

260 265 270 

He Tyr Gin His Glu Gly He Ser Ser Phe Phe Arg Gly Ala Phe 

275 280 285 

Ser Asn Val Leu Arg Gly Thr Gly Gly Ala Leu Val Leu Val Leu 

290 295 300 

Tyr Asp Lys He Lys Glu Phe Phe His He Asp He Gly Gly Arg 

305 310 315 



<210> 20 
<211> 540 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5686561CD1 

<400> 20 „ 
Met Val Pro Ala Gly Trp Val Arg Gly Leu Glu Leu Ser Leu Trp 
! 5 10 15 

Gly Gly Asp Pro Val Val Pro Trp Ser Cys Arg Phe Cys Ser Gin 
20 25 30 

Gin Asp Asp Gly Gin Asp Arg Glu Arg Leu Thr Tyr Phe Gin Asn 
35 40 45 

Leu Pro Glu Ser Leu Thr Ser Leu Leu Val Leu Leu Thr Thr Ala 
50 55 60 

Asn Asn Pro Asp Val Met He Pro Ala Tyr Ser Lys Asn Arg Ala 
65 70 75 

Tyr Ala He Phe Phe He Val Phe Thr Val He Gly Ser Leu Phe 

80 85 90 . 

Leu Met Asn Leu Leu Thr Ala He He Tyr Ser Gin Phe Arg Gly 
95 100 105 

Tvr Leu Met Lys Ser Leu Gin Thr Ser Leu Phe Arg Arg Arg Leu 
110 H5 120 

Glv Thr Arg Ala Ala Phe Glu Val Leu Ser Ser Met Val Gly Glu 
125 130 135 

Gly Gly Ala Phe Pro Gin Ala Val Gly Val Lys Pro Gin Asn Leu 
140 145 150 

Leu Gin Val Leu Gin Lys Val Gin Leu Asp Ser Ser His Lys Gin 
155 160 165 

Ala Met Met Glu Lys Val Arg Ser Tyr Asp Ser Val Leu Leu Ser 
170 175 180 

Ala Glu Glu Phe Gin Lys Leu Phe Asn Glu Leu Asp Arg Ser Val 
185 190 195 

Val Lys Glu His Pro Pro Arg Pro Glu Tyr Gin Ser Pro Phe Leu 
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200 205 210 

Gin Ser Ala Gin Phe Leu Phe Gly His Tyr Tyr Phe Asp Tyr Leu 

215 220 225 

Gly Asn Leu lie Ala Leu Ala Asn Leu Val Ser He Cys Val Phe 

230 235 240 

Leu Val Leu Asp Ala Asp Val Leu Pro Ala Glu Arg Asp Asp Phe 

245 250 255 

He Leu Gly He Leu Asn Cys Val Phe He Val Tyr Tyr Leu Leu 

260 265 270 

Glu Met Leu Leu Lys Val Phe Ala Leu Gly Leu Arg Gly Tyr Leu 

275 280 285 

Ser Tyr Pro Ser Asn Val Phe Asp Gly Leu Leu Thr Val Val Leu 

290 295 300 

Leu Val Leu Glu He Ser Thr Leu Ala Val Tyr Arg Leu Pro His 

305 310 315 

Pro Gly Trp Arg Pro Glu Met Val Gly Leu Leu Ser Leu Trp Asp 

320 325 330 

Met Thr Arg Met Leu Asn Met Leu He Val Phe Arg Phe Leu Arg 

335 340 345 

He He Pro Ser Met Lys Pro Met Ala Val Val Ala Ser Thr Val 

350 355 360 

Leu Gly Leu Val Gin Asn Met Arg Ala Phe Gly Gly He Leu Val 

365 370 375 

Val Val Tyr Tyr Val Phe Ala He He Gly He Asn Leu Phe Arg 

380 385 390 

Gly Val He Val Ala Leu Pro Gly Asn Ser Ser Leu Ala Pro Ala 

395 400 405 

Asn Gly Ser Ala Pro Cys Gly Ser Phe Glu Gin Leu Glu Tyr Trp 

410 415 420 

Ala Asn Asn Phe Asp Asp Phe Ala Ala Ala Leu Val Thr Leu Trp 

425 430 435 

Asn Leu Met Val Val Asn Asn Trp Gin* Val Phe Leu Asp Ala Tyr 

440 445 450 

Arg Arg Tyr Ser Gly Pro Trp Ser Lys He Tyr Phe Val Leu Trp 

455 460 465 

Trp Leu Val Ser Ser Val He Trp Val Asn Leu Phe Leu Ala Leu 
470 475 480 

He Leu Glu Asn Phe Leu His Lys Trp Asp Pro Arg Ser His Leu 
485 490 495 

Gin Pro Leu Ala Gly Thr Pro Glu Ala Thr Tyr Gin Met Thr Val 
500 505 510 

Glu Leu Leu Phe Arg Asp He Leu Glu Glu Pro Gly Glu Asp Glu 
515 520 525 

Leu Thr Glu Arg Leu Ser Gin His Pro His Leu Trp Leu Cys Arg 
530 535 540 



<210> 21 

<211> 322 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1553725CD1 

<400> 21 

Met Glu Ala Asp Leu Ser Gly Phe Asn He Asp Ala Pro Arg Trp 
1 5 10 " 15 

Asp Gin Arg Thr Phe Leu Gly Arg Val Lys His Phe Leu Asn He 
. 20 25 30 

Thr Asp Pro Arg Thr Val Phe Val Ser Glu Arg Glu Leu Asp Trp 
35 40 45 
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Ala Lys Val Met Val Glu Lys Ser Arg Met Gly Val Val Pro Pro 

50 55 60 

Gly Thr Gin Val Glu Gin Leu Leu Tyr Ala Lys Lys Leu Tyr Asp 

65 70 75 

Ser Ala Phe His Pro Asp Thr Gly Glu Lys Met Asn Val lie Gly 

80 85 90 

Arg Met Ser Phe Gin Leu Pro Gly Gly Met He He Thr Gly Phe 

95 100 105 

Met Leu Gin Phe Tyr Arg Thr Met Pro Ala Val He Phe Trp Gin 

110 ~ 115 120 

Trp Val Asn Gin Ser Phe Asn Ala Leu Val Asn Tyr Thr Asn Arg 

125 130 135 

Asn Ala Ala Ser Pro Thr Ser Val Arg Gin Met Ala Leu Ser Tyr 

140 145 150 

Phe Thr Ala Thr Thr Thr Ala Val Ala Thr Ala Val Gly Met Asn 

155 160 165 

Met Leu Thr Lys Lys Ala Pro Pro Leu Val Gly Arg Trp Val Pro ' 

170 175 180 

Phe Ala Ala Val Ala Ala Ala Asn Cys Val Asn He Pro Met Met 

185 190 195 

Arg Gin Gin Glu Leu He Lys Gly He Cys Val Lys Asp Arg Asn 

200 205 210 

Glu Asn Glu He Gly His' Ser Arg Arg Ala Ala Ala He Gly He 

215 220 225 

Thr Gin Val Val He Ser Arg He Thr Met Ser Ala Pro Gly Met 

230 235 240 

He Leu Leu Pro Val He Met Glu Arg Leu Glu Lys Leu His Phe 

245 250 255 

Met Gin Lys Val Lys Val Leu His Ala Pro Leu Gin Val Met Leu 

260 265 270 

Ser Gly Cys Phe Leu He Phe Met Val Pro Val Ala Cys Gly Leu 

275 280 285 

Phe Pro Gin Lys Cys Glu Leu Pro Val Ser Tyr Leu Glu Pro Lys 

290 295 300 

Leu Gin Asp Thr He Lys Ala Lys Tyr Gly Glu Leu Glu Pro Tyr 

305 310 315 

Val Tyr Phe Asn Lys Gly Leu 

320 

<210> 22 
<211> 417 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1695770CD1 

<400> 22 

rtvr Leu val Pro Ala Thr Leu Ser Phe Leu Leu Leu Trp 

10 15 
u Arg Val Ala Leu Ala Lys Glu 
25 30 
y Ser Gin Pro Met Ser Pro Ser 
40 45 
y Arg Thr Ser Gly Tyr Asp Ala 
55 60 
y Pro Pro Val Asn Val Thr Cys 
70 75 
r Ser Val Thr Lys Thr Thr Met 
85 90 
u Arg Gin Gin Trp Asn Asp Pro 
95 100 105 



Met 


Thr 


Thr 


Leu 


Val Pro 


Ala 


1 








5 




Thr 


Leu 


Pro 


Gly 


Gin Val 


Leu 










20 




Glu 


Val 


Lys 


Ser 


Gly Thr 


Lys 










35 




Asp 


Phe 


Leu 


ASP 


Lys Leu 


Met 










50 




Arg 


He 


Arg 


Pro 


Asn Phe 


Lys 










65 




Asn 


He 


Phe 


He 


Asn Ser 


Phe 










80 




Asp 


Tyr 


Arg 


Val 


Asn Val 


Phe 
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Arg Leu Ser Tyr Arg Glu Tyr Pro Asp Asp Ser Leu Asp Leu Asp 
110 115 120 

Pro Ser Met Leu Asp Ser He Trp Lys Pro Asp Leu Phe Phe Ala 

Asn Glu Lys Gly Ala Asn Phe His Glu Val Thr Thr Asp Asn Lys 

140 145 150 

Leu Leu Arg He Phe Lys Asn Gly Asn Val Leu Tyr Ser He Arg 

155 160 165 

Leu Thr Leu He Leu Ser Cys Leu Met Asp Leu Lys Asn. Phe Pro 

170 175 180 

Met Asp He Gin Thr Cys Thr Met Gin Leu Glu Ser Phe Gly Tyr 

185 190 195 

Thr Met Lys Asp Leu Val Phe Glu Trp Leu Glu Asp Ala Pro Ala 

200 205 210 

Val Gin Val Ala Glu Gly Leu Thr Leu Pro Gin Phe He Leu Arg 

215 220 225 

Asp Glu Lys Asp Leu Gly Cys Cys Thr Lys His Tyr Asn Thr Gly 

230 235 240 

Lys Phe Thr Cys He Glu Val Lys Phe His Leu Glu Arg Gin Met 

245 250 255 

Gly Tyr Tyr Leu He Gin Met Tyr He Pro Ser Leu Leu He Val 

260 265 270 

He Leu Ser Trp Val Ser Phe Trp He Asn Met Asp Ala Ala Pro 

275 280 285 

Ala Arg Val Gly Leu Gly He Thr Thr Val Leu Thr Met Thr Thr 

290 295 300 

Gin Ser Ser Gly Ser Arg Ala Ser Leu Pro Lys Val Ser Tyr Val 

305 310 315 

Lys Ala He Asp He Trp Met Ala Val Cys Leu Leu Phe Val Phe 

320 325 330 

Ala Ala Leu Leu Glu Tyr Ala Ala He Asn Phe Val Ser Arg Gin 

335 340 345 

His Lys Glu Phe He Arg Leu Arg Arg Arg Gin Arg Arg Gin Arg 

350 355 360 

Leu Glu Glu Asp He He Gin Glu Ser Arg Phe Tyr Phe Arg Gly 

365 370 375 

Tyr Gly Leu Gly His Cys Leu Gin Ala Arg Asp Gly Gly Pro Met 

380 385 390 

Glu Gly Ser Gly He Tyr Ser Pro Gin Pro Pro Ala Pro Leu Leu 

395 400 405 

Arg Glu Gly Glu Thr Thr Arg Lys Leu Tyr Val Asp 

410 415 

<210> 23 

<211> 1864 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4672222CD1 

<400> 23 

Met Ser Gin Lys Ser Trp He Glu Ser Thr Leu Thr Lys Arg Glu 

1 5 10 15 

Cys Val Tyr He He Pro Ser Ser Lys Asp Pro His Arg Cys Leu 

20 25 30 

Pro Gly Cys Gin He Cys Gin Gin Leu Val Arg Cys Phe Cys Gly 

35 40 45 

Arg Leu Val Lys Gin His Ala Cys Phe Thr Ala Ser Leu Ala Met 

50 55 60 
Lys Tyr Ser Asp Val Lys Leu Gly Asp His Phe Asn Gin Ala He 

65 70 75 
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Glu Glu Trp Ser Val Glu Lys His Thr Glu Gin Ser Pro Thr Asp 
80 85 90 

Ala Tyr Gly Val lie Asn Phe Gin Gly Gly Ser His Ser Tyr Arg 
95 100 105 

Ala Lys Tyr Val Arg Leu Ser Tyr Asp Thr Lys Pro Glu Val He 
HO 115 120 

Leu Gin Leu Leu Leu Lys Glu Trp Gin Met Glu Leu Pro Lys Leu 
125 130 135 

Val He Ser Val His Gly Gly Met Gin Lys Phe Glu Leu His Pro 
140 145 150 

Arg He Lys Gin Leu Leu Gly Lys Gly Leu He Lys Ala Ala Val 
155 160 ^ 165 

. Thr Thr Gly Ala Trp He Leu Thr Gly Gly Val Asn Thr Gly Val 
170 175 180 

Ala Lys His Val Gly Asp Ala Leu Lys Glu His Ala Ser Arg Ser 
185 190 195 

Ser Arg Lys He Cys Thr He Gly He Ala Pro Trp Gly Val He 
200 205 210 

Glu Asn Arg Asn Asp Leu Val Gly Arg Asp Val Val Ala Pro Tyr 
215 220 225 

Gin Thr Leu Leu Asn Pro Leu Ser Lys Leu Asn Val Leu Asn Asn 
230 235 240 

Leu His Ser His Phe He Leu Val Asp Asp Gly Thr Val Gly Lys 
245 250 255 

Tvr Gly Ala Glu Val Arg Leu Arg Arg Glu Leu Glu Lys Thr He 
260 " 265 270 

Asn Gin Gin Arg He His Ala Arg He Gly Gin Gly Val Pro Val 
275 280 285 

Val Ala Leu He Phe Glu Gly Gly Pro Asn Val He Leu Thr Val 
290 295 300 

Leu Glu Tyr Leu Gin Glu Ser Pro Pro Val Pro Val Val Val Cys 
305 310 315 

Glu Gly Thr Gly Arg Ala Ala Asp Leu Leu Ala Tyr He His Lys 
320 325 330 

Gin Thr Glu Glu Gly Gly Asn Leu Pro Asp Ala Ala Glu Pro Asp 
335 " 340 345 

He He Ser Thr He Lys Lys Thr Phe Asn Phe Gly Gin Asn Glu 
350 355 360 

Ala Leu His Leu Phe Gin Thr Leu Met Glu Cys Met Lys Arg Lys 
365 370 375 

Glu Leu He Thr Val Phe His He Gly Ser Asp Glu His Gin Asp 
380 385 390 

He Asp Val Ala He Leu Thr Ala Leu Leu Lys Gly Thr Asn Ala 
395 400 405 

Ser Ala Phe Asp Gin Leu He Leu Thr Leu Ala Trp Asp Arg Val 
410 * 415 420 

Asp He Ala Lys Asn His Val Phe Val Tyr Gly Gin Gin Trp Leu 
425 430 435 

Val Gly Ser Leu Glu Gin Ala Met Leu Asp Ala Leu Val Met Asp 
440 445 450 

Arg Val Ala Phe Val Lys Leu Leu He Glu Asn Gly Val Ser Met 
455 460 465 

His Lys Phe Leu Thr He Pro Arg Leu Glu Glu Leu Tyr Asn Thr 
470 475 480 

Lvs Gin Gly Pro Thr Asn Pro Met Leu Phe His Leu Val Arg Asp 
485 490 495 

Val Lys Gin Gly Asn Leu Pro Pro Gly Tyr Lys He Thr Leu He 
500 505 510 

Asp He Gly Leu Val He Glu Tyr Leu Met Gly Gly Thr Tyr Arg 
515 520 525 

Cvs Thr Tyr Thr Arg Lys Arg Phe Arg Leu He Tyr Asn Ser Leu 
530 535 540 

Gly Gly Asn Asn Arg Arg Ser Gly Arg Asn Thr Ser Ser Ser Thr 
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545 550 555 

Pro Gin Leu Arg Lys Ser His Glu Ser Phe Gly Asn Arg Ala Asp 
560 565 570 

Lys Lys Glu Lys Met Arg His Asn His Phe lie Lys Thr Ala Gin 
575 580 585 

Pro Tyr Arg Pro Lys He Asp Thr Val Met Glu Glu Gly Lys Lys 
590 595 600 

Lys Arg Thr Lys Asp Glu He Val Asp He Asp Asp Pro Glu Thr 
605 610 615 

Lys Arg Phe Pro Tyr Pro Leu Asn Glu Leu Leu He Trp Ala Cys 
620 625 630 

Leu Met Lys Arg Gin Val Met Ala Arg Phe Leu Trp Gin His Gly 
635 640 645 

Glu Glu Ser Met Ala Lys Ala Leu Val Ala Cys Lys He Tyr Arg 
650 655 660 

Ser Met Ala Tyr Glu Ala Lys Gin Ser Asp Leu Val Asp Asp Thr 
665 670 675 

Ser Glu Glu Leu Lys Gin Tyr Ser Asn Asp Phe Gly Gin Leu Ala 
680 685 690 

Val Glu Leu Leu Glu Gin Ser Phe Arg Gin Asp Glu Thr Met Ala 
695 700 705 

Met Lys Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ser Thr 
710 715 720 

Cys Leu Lys Leu Ala Val Ser Ser Arg Leu Arg Pro Phe Val Ala 
725 730 735 

His Thr Cys Thr Gin Met Leu Leu Ser Asp Met Trp Met Gly Arg 
740 745 750 

Leu Asn Met Arg Lys Asn Ser Trp Tyr Lys Val He Leu Ser He 
755 760 765 

Leu Val Pro Pro Ala He Leu Leu Leu Glu Tyr Lys Thr Lys Ala 
770 775 780 

Glu Met Ser His He Pro Gin Ser Gin Asp Ala His Gin Met Thr 
785 790 795 

Met Asp Asp Ser Glu Asn Asn Phe Gin Asn He Thr Glu Glu He 
800 805 810 

Pro Met Glu Val Phe Lys Glu Val Arg He Leu Asp Ser Asn Glu 
815 820 825 

Gly Lys Asn Glu Met Glu He Gin Met Lys Ser Lys Lys Leu Pro 
830 835 840 

He Thr Arg Lys Phe Tyr Ala Phe Tyr His Ala Pro He Val Lys 
845 850 855 

Phe Trp Phe Asn Thr Leu Ala Tyr Leu Gly Phe Leu Met Leu Tyr 
860 865 870 

Thr Phe Val Val Leu Val Gin Met Glu Gin Leu Pro Ser Val Gin 
875 880 885 

Glu Trp He Val He Ala Tyr He Phe Thr Tyr Ala He Glu Lys 
890 895 900 

Val Arg Glu He Phe Met Ser Glu Ala Gly Lys Val Asn Gin Lys 
905 910 915 

He Lys Val Trp Phe Ser Asp Tyr Phe Asn He Ser Asp Thr He 
920 925 930 

Ala He He Ser Phe Phe He Gly Phe Gly Leu Arg Phe Gly Ala 
935 940 945 

Lys Trp Asn Phe Ala Asn Ala Tyr Asp Asn His Val Phe Val Ala 
950 955 960 

Gly Arg Leu lie Tyr Cys Leu Asn He He Phe Trp Tyr Val Arg 
965 970 975 

Leu Leu Asp Phe Leu Ala Val Asn Gin Gin Ala Gly Pro Tyr Val 
980 985 990 

Met Met He Gly Lys Met Val Ala Asn Met Phe Tyr He Val Val 
995 1000 1005 

He Met Ala Leu Val Leu Leu Ser Phe Gly Val Pro Arg Lys Ala 
1010 1015 1020 
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He Leu Tyr Pro His Glu Ala Pro Ser Trp Thr Leu Ala Lys Asp 
1025 1030 1035 

He Val Phe His Pro Tyr Trp Met He Phe Gly Glu Val Tyr Ala 
1040 1045 1050 

Tyr Glu He Asp Val Cys Ala Asn Asp Ser Val He Pro Gin He 
1055 1060 1065 

Cys Gly Pro Gly Thr Trp Leu Thr Pro Phe Leu Gin Ala Val Tyr 
1070 " 1075 1080 

Leu Phe Val Gin Tyr He He Met Val Asn Leu Leu He Ala Phe 
1085 1090 1095 

Phe Asn Asn Val Tyr Leu Gin Val Lys Ala He Ser Asn He Val 
1100 1105 H10 

Trp Lys Tyr Gin Arg Tyr His Phe He Met Ala Tyr His Glu Lys 
1115 1120 H25 

Pro Val Leu Pro Pro Pro Leu He He Leu Ser His He Val Ser 
1130 1135 H40 

Leu Phe Cys Cys He Cys Lys Arg Arg Lys Lys Asp Lys Thr Ser 
1145 1150 1155 

Asp Gly Pro Lys Leu Phe Leu Thr Glu Glu Asp Gin Lys Lys Leu 
1160 1165 1170 

His Asp Phe Glu Glu Gin Cys Val Glu Met Tyr Phe Asn Glu Lys 
1175 1180 1185 

Asp Asp Lys Phe His Ser Gly Ser Glu Glu Arg He Arg Val Thr 
1190 1195 1200 

Phe Glu Arg Val Glu Gin Met Cys He Gin He Lys Glu Val Gly 
1205 1210 1215 

Asp Arg Val Asn Tyr He Lys Arg Ser Leu Gin Ser Leu Asp Ser 
1220 1225 1230 

Gin He Gly His Leu Gin Asp Leu Ser Ala Leu Thr Val Asp Thr 
1235 1240 1245 

Leu Lys Thr Leu Thr Ala Gin Lys Ala Ser Glu Ala Ser Lys Val 
1250 1255 1260 

His Asn Glu He Thr Arg Glu Leu Ser He Ser Lys His Leu Ala 
1265 1270 1275 

Gin Asn Leu He Asp Asp Gly Pro Val Arg Pro Ser Val Trp Lys 
1280 1285 1290 

Lys His Gly Val Val Asn Thr Leu Ser Ser Ser Leu Pro Gin Gly 
1295 1300 1305 

Asp Leu Glu Ser Asn Asn Pro Phe His Cys Asn He Leu Met Lys 
1310 1315 1320 

Asp Asp Lys Asp Pro Gin Cys Asn He Phe Gly Gin Asp Leu Pro 
1325 1330 1335 

Ala Val Pro Gin Arg Lys Glu Phe Asn Phe Pro Glu Ala Gly Ser 
1340 1345 1350 

Ser Ser Gly Ala Leu Phe Pro Ser Ala Val Ser Pro Pro Glu Leu 
1355 1360 1365 

Arg Gin Arg Leu His Gly Val Glu Leu Leu Lys He Phe Asn Lys 
1370 1375 1380 

Asn Gin Lys Leu Gly Ser Ser Ser Thr Ser He Pro His Leu Ser 
1385 1390 1395 

Ser Pro Pro Thr Lys Phe Phe Val Ser Thr Pro Ser Gin Pro Ser 
1400 1405 1410 

Cys Lys Ser His Leu Glu Thr Gly Thr Lys Asp Gin Glu Thr Val 
1415 1420 1425 

Cys Ser Lys Ala Thr Glu Gly Asp Asn Thr Glu Phe Gly Ala Phe 
1430 1435 1440 

Val Gly His Arg Asp Ser Met Asp Leu Gin Arg Phe Lys Glu Thr 
1445 1450 1455 

Ser Asn Lys He Lys He Leu Ser Asn Asn Asn Thr Ser Glu Asn 
1460 1465 1470 

Thr Leu Lys Arg Val Ser Ser Leu Ala Gly Phe Thr Asp Cys His 
1475 1480 1485 

Arg Thr Ser He Pro Val His Ser Lys Gin Glu Lys He Ser Arg 
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1490 1495 1500 

Arg Pro Ser Thr Glu Asp Thr His Glu Val Asp Ser Lys Ala Ala 
1505 1510 xo±o 

Leu lie Pro Asp Trp Leu Gin Asp Arg Pro Ser Asn Arg Glu Met 
15 20 1525 1530 

Pro Ser Glu Glu Gly Thr Leu Asn Gly Leu Thr Ser Pro Phe Lys 
153 5 1540 1545 

Pro Ala Met Asp Thr Asn Tyr Tyr Tyr Ser Ala Val Glu Arg Asn 
1550 1555 1560 

Asn Leu Met Arg Leu Ser Gin Ser lie Pro Phe Thr Pro Val Pro 
1565 1570 1575 

Pro Arg Gly Glu Pro Val Thr Val Tyr Arg Leu Glu Glu Ser Ser 
1580 1585 ^ v 

Pro Asn He Leu Asn Asn Ser Met Ser Ser Trp Ser Gin Leu Gly 
1595 1600 1605 

Leu Cys Ala Lys lie Glu Phe Leu Ser Lys Glu Glu Met Gly Gly 
1610 1615 lb/u 

Gly Leu Arg Arg Ala Val Lys Val Gin Cys Thr Trp Ser Glu His 
1625 1630 1635 

Asp He Leu Lys Ser Gly His Leu Tyr lie He Lys Ser Phe Leu 
1640 1645 J-"" 

Pro Glu Val Val Asn Thr Trp Ser Ser He Tyr Lys Glu Asp Thr 
1655 1660 1665 

Val Leu His Leu Cys Leu Arg Glu He Gin Gin Gin Arg Ala Ala 
1670 1675 1680 

Gin Lys Leu Thr Phe Ala Phe Asn Gin Met Lys Pro Lys Ser lie 
1685 1690 1695 

Pro Tyr Ser Pro Arg Phe Leu Glu Val Phe Leu Leu Tyr Cys His 
1700 1705 1710 

Ser Ala Gly Gin Trp Phe Ala Val Glu Glu Cys Met Thr Gly Glu 
1715 1720 172b 

Phe Arg Lys Tyr Asn Asn Asn Asn Gly Asp Glu He He Pro Thr 
1730 1735 l/«u 

Asn Thr Leu Glu Glu He Met Leu Ala Phe Ser His Trp Thr Tyr 
1745 1750 1155 

Glu Tyr Thr Arg Gly Glu Leu Leu Val Leu Asp Leu Gin Gly Val 
1760 1765 I 770 

Gly Glu Asn Leu Thr Asp Pro Ser Val He Lys Ala Glu Glu Lys 
I775 1780 l' 85 

Arg Ser Cys Asp Met Val Phe Gly Pro Ala Asn Leu Gly Glu Asp 
1790 1795 1» 00 

Ala He Lys Asn Phe Arg Ala Lys His His Cys Asn Ser Cys Cys 
1805 1810 1815 

Arg Lys Leu Lys Leu Pro Asp Leu Lys^Arg Asn Asp Tyr Thrjro 

Asp Lys He IkS Pro Gin Asp Glu Pro Ser Asp Leu Asn Leu 

1835 1840 1845 

Gin Pro Gly Asn Ser Thr Lys Glu Ser Glu Ser Thr Asn Ser Val 

1850 1855 I860 

Arg Leu Met Leu 



<210> 24 
<211> 1237 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6176128CD1 

<400> 24 

Met Ala Arg Ala Lys Leu Pro Arg 



Ser Pro Ser Glu Gly Lys Ala 
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! 5 10 " 



Gly Pro Gly Gly Ala Pro Ala Gly Ala Ala Ala Pro Glu 61» Pro 
20 

His Gly Leu Ser Pro Leu Leu Pro Ala Arg Gly Gly Gly Ser Val 

35 40 
Gly Ser Asp Val Gly Gin Arg Leu Pro Val Glu Asp Phe Ser Leu 

Asp Ser Ser Leu Ser Gin Val Gin Val Glu Phe Tyr Val Asn Glu 
65 

Asn Thr Phe Lys Glu Arg Leu Lys Leu Phe Phe lie Lys Asn Gin 

80 " 
Arg Ser Ser Leu Arg He Arg Leu Phe Asn Phe Ser Leu Lys Leu 

55 100 
Leu Thr Cys Leu Leu Tyr lie Val Arg Val Leu Leu Asp Asp Pro 



110 



Ala Leu Gly He Gly Trp Trp Gly Cys Pro Arg Gin Asn Tyr Ser 



125 

Phe Asn Asp Ser Ser Ser Glu He Asn Trp Ala Pro He Leu Trp 

140 143 

Val Glu Arg Lys Met Thr Leu Trp Ala He Gin Val He Val Ala 

155 loU 
He He Ser Phe Leu Glu Thr Met Leu Leu He Tyr Leu Ser Tyr 

170 17b 
Lys Gly Asn He Trp Glu Gin He Phe Arg Val Ser Phe Val Leu 

-105 190 
Glu Met He Asn Thr Leu Pro Phe He lie Thr He Phe Trp Pro 

200 20d 
Pro Leu Arg Asn Leu Phe He Pro Val Phe Leu Asn Cys Trp Leu 

215 22U 
Ala Lys His Ala Leu Glu Asn Met He Asn Asp Phe His Arg Ala 

230 235 
lie Leu Arg Thr Gin Ser Ala Met Phe Asn Gin Val Leu He Leu 

245 250 n 

Phe Cys Thr Leu Leu Cys Leu Val Phe Thr Gly Thr Cys Gly He 



260 



265 



Gin His Leu Glu Arg Ala Gly Glu Asn Leu Ser Leu Leu Thr Ser 

275 280 
Phe Tyr Phe Cys lie Val Thr Phe Ser Thr Val Gly Tyr Gly Asp 



290 " 

Val Thr Pro Lys He Trp Pro Ser Gin Leu Leu Val Val He Met 



305 

lie Cys Val Ala Leu Val Val Leu Pro Leu Gin Phe Glu Glu Leu 

320 325 
Val Tyr Leu Trp Met Glu Arg Gin Lys Ser Gly Gly Asn Tyr Ser 

235 340 
Arg His Arg Ala Gin Thr Glu Lys His Val Val Leu Cys Val Ser 

350 3br> 
Ser Leu Lys He Asp Leu Leu Met Asp Phe Leu Asn Glu Phe Tyr 

365 370 
Ala His Pro Arg Leu Gin Asp Tyr Tyr Val Val He Leu Cys Pro 

380 385 
Thr Glu Met Asp Val Gin Val Arg Arg Val Leu Gin He Pro Leu 

395 400 
Trp Ser Gin Arg Val He Tyr Leu Gin Gly Ser Ala Leu Lys Asp 

410 415 Z 

Gin Asp Leu Met Arg Ala Lys Met Asp Asn Gly Glu Ala Cys Phe 

425 430 , 

lie Leu Ser Ser Arg Asn Glu Val Asp Arg Thr Ala Ala Asp Hxs 

440 445 
Gin Thr He Leu Arg Ala Trp Ala Val Lys Asp Phe Ala Pro Asn 

455 460 7 

Cys Pro Leu Tyr Val Gin He Leu Lys Pro Glu Asn Lys Phe His 

470 475 



310 
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Val Lys Phe Ala Asp His Val Val Cys Glu Glu Glu Cys Lys Tyr 



485 

Ala Met Leu Ala Leu Asn Cys lie Cys Pro Ala Thr Ser Thr Leu 



lie Thr Leu Leu III His Thr Ser Arg Gly Gin Glu Gly Gin Glu 



505 510 



515 

Ser Pro Glu Gin Trp Gin Arg Met Tyr Gly Arg Cys Ser Gly Asn 



520 525 



530 ^ — 

Glu Val Tyr His lie Arg Met Gly Asp Ser Lys Phe Phe Arg Glu 



535 540 



545 

Tyr Glu Gly Lys Ser Phe Thr Tyr Ala Ala Phe His Ala Hxs Lys 



550 



560 



565 570 



Lys Tyr Gly Val Cys Leu lie Gly Leu Lys Arg Glu Asp Asn Lys 



575 580 585 

Ser lie Leu Leu Asn Pro Gly Pro Arg His lie Leu Ala Ala Ser 
590 

Asp Thr Cys Phe Tyr lie Asn He Thr Lys Glu Glu Asn Ser Ala 

605 610 
Phe lie Phe Lys Gin Glu Glu Lys Arg Lys Lys Arg Ala Phe Ser 

620 o2o 
Gly Gin Gly Leu His Glu Gly Pro Ala Arg Leu Pro Val His Ser 

635 o4U u ^ 

lie lie Ala Ser Met Gly Thr Val Ala Met Asp Leu Gin Gly Thr 

650 655 
Glu His Arg Pro Thr Gin Ser Gly Gly Gly Gly Gly Gly Ser Lys 

665 67U 
Leu Ala Leu Pro Thr Glu Asn Gly Ser Gly Ser Arg Arg Pro Ser 

680 " 85 
He Ala Pro Val Leu Glu Leu Ala Asp Ser Ser Ala Leu Leu Pro 

695 lv\J 
Cys Asp Leu Leu Ser Asp Gin Ser Glu Asp Glu Val Thr Pro Ser 

710 715 
Asp Asp Glu Gly Leu Ser Val Val Glu Tyr Val Lys Gly Tyr Pro 

725 730 
Pro Asn Ser Pro Tyr He Gly Ser Ser Pro Thr Leu Cys His Leu 

740 745 , 

Leu Pro Val Lys Ala Pro Phe Cys Cys Leu Arg Leu Asp Lys Gly 



755 

Cys Lys His Asn Ser Tyr Glu Asp Ala Lys Ala Tyr Gly Phe Lys 



760 



770 

Asn Lys Leu He He Val Ser Ala Glu Thr Ala Gly Asn Gly Leu 



775 



785 



790 



Tyr Asn Phe lie Val Pro Leu Arg Ala Tyr Tyr Arg Ser Arg Lys 

800 • 

Glu Leu Asn Pro He Val Leu Leu Leu Asp Asn Lys Pro Asp Hxs 

815 8 

His Phe Leu Glu Ala He Cys Cys Phe Pro Met Val Tyr Tyr Met 

830 o-3d 
Glu Gly Ser Val Asp Asn Leu Asp Ser Leu Leu Gin Cys Gly He 

lie Tyr Ala Asp Asn Leu Val Val Val Asp Lys Glu Ser Thr Met 

860 8 " 
Ser Ala Glu Glu Asp Tyr Met Ala Asp Ala Lys Thr He Val Asn 

875 880 
Val Gin Thr Met Phe Arg Leu Phe Pro Ser Leu Ser He Thr Thr 

890 895 
Glu Leu Thr His Pro Ser Asn Met Arg Phe Met Gin Phe Arg Ala 

905 910 y -r 

Lys Asp Ser Tyr Ser Leu Ala Leu Ser Lys Leu Glu Lys Arg Glu 

920 92-> 
Arg Glu Asn Gly Ser Asn Leu Ala Phe Met Phe Arg Leu Pro Phe 

935 940 J ^ 

Ala Ala Gly Arg Val Phe Ser He Ser Met Leu Asp Thr Leu Leu 
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950 9 55 960 

Tyr Gin Ser Phe Val Lys Asp Tyr Met lie Thr lie Thr Arg Leu 
965 y /v 

Leu Leu Gly Leu Asp Thr Thr Pro Gly Ser Gly Tyr Leu Cys Ala 

980 yoD 

Met Lys lie Thr Glu Gly Asp Leu Trp^le Arg Thr Tyr Gly^g 

Leu Phe Gin Lys Leu Cys Ser Ser Ser Ala Glu lie Pro lie Gly 

1010 1015 
He Tyr Arg Thr Glu Ser His Val Phe Ser Thr Ser Glu Pro gs 

1025 IUjU 

Glu Leu Arg Ala Gin Ser Gin lie Ser Val Asn Val Glu Asp Cys 

1040 1U4D * 

Glu Asp Thr Arg Glu Val Lys Gly Pro Trp Gly Ser Arg Ala G y 
1055 lUbU 

Thr Gly Gly Ser Ser Gin Gly Arg His Thr Gly Gly Gly Asp Pro 

1070 J.U/D 
Ala Glu His Pro Leu Leu Arg Arg Lys Ser Leu Gin Trp Ala Arg 

1 085 -LUjU 
Arg Leu Ser Arg Lys Ala Pro Lys Gin Ala Gly Arg Ala Ala Ala 

1100 J-J-UD 

Ala Glu Trp He Ser Gin Gin Arg Leu Ser Leu Tyr Arg Arg Ser 

1115 JLJ.J&V 

Glu Arg Gin Glu Leu Ser Glu Leu Val Lys Asn Arg Met Lys Hi. 

1130 JLXoj 

Leu Gly Leu Pro Thr Thr Gly Tyr Glu^sp Val Ala Asn Leujhr 
Ala Ser Asp Val^et Asn Arg Val Asn Leu Gly Tyr Leu Gin Asp 

1160 1J.DD 

Glu Met Asn Asp His Gin Asn Thr Leu Ser Tyr Val Leu He Asn 

1175 lJLoU 

Pro Pro Pro Asp Thr Arg Leu Glu Proper Asp He Val Tyr^eu 



1190 — 
lie Arg Ser Asp Pro Leu Ala His Val Ala Ser Ser Ser Gin Ser 

1205 xz±u 
Arg Lys Ser Ser Cys Ser His Lys Leu Ser Ser Cys Asn Pro Glu 

1220 izzd 
Thr Arg Asp Glu Thr Gin Leu 
1235 

<210> 25 

<211> 539 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7473418CD1 

S"°«a 5 S er Ala Leu Ser Tyr val Ser Lye Phe Lys Ser Phe Val 
lie Leu Phe V.1 Thr Pro Leu Leu Leu Leu Pro Leu Val lie Leu 



20 - - 

Met Pro Ala Lys Phe Val Arg Cys Ala Tyr Val He He Leu Met 

Ala lie Tyr Trp Cys Thr Glu Val He Pro Leu Ala Val Thr Ser 



25 



50 so 
Leu Met Pro Val Leu Leu Phe Pro Leu Phe Gin lie Leu Asp Ser 

Arg Gin Val Cys Val Gin Tyr Met Lys Asp Thr Asn Met Leu Phe 

80 

Leu Gly Gly Leu lie Val Ala Val Ala Val Glu Arg Trp Asn Leu 
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95 100 105 

His Lys Arg lie Ala Leu Arg Thr Leu Leu Trp Val Gly Ala Lys 
110 115 120 

Pro Ala Arg Leu Met Leu Gly Phe Met Gly Val Thr Ala Leu Leu 
125 130 135 

Ser Met Trp He Ser Asn Thr Ala Thr Thr Ala Met Met Val Pro 

140 145 150 

He Val Glu Ala He Leu Gin Gin Met Glu Ala Thr Ser Ala Ala 

155 160 165 

Thr Glu Ala Gly Leu Glu Leu Val Asp Lys Gly Lys Ala Lys Glu 

170 175 180 

Leu Pro Ala Asn Ser Ala Val Pro Thr Thr Gly Ser Gin Val He 

185 190 195 

Phe Glu Gly Pro Thr Leu Gly Gin Gin Glu Asp Gin Glu Arg Lys 

200 205 210 

Arg Leu Cys Lys Ala Met Thr Leu Cys He Cys Tyr Ala Ala Ser 

215 220 225 

He Gly Gly Thr Ala Thr Leu Thr Gly Thr Gly Pro Asn Val Val 

230 235 240 

Leu Leu Gly Gin Met Asn Glu Leu Phe Pro Asp Ser Lys Asp Leu 

245 250 255 

Val Asn Phe Ala Ser Trp Phe Ala Phe Ala Phe Pro Asn Met Leu 

260 265 270 

Val Met Leu Leu Phe Ala Trp Leu Trp Leu Gin Phe Val Tyr Met 

275 280 285 

Arg Phe Asn Phe Lys Lys Ser Trp Gly Cys Gly Leu Glu Ser Lys 

290 295 300 

Lys Asn Glu Lys Ala Ala Leu Lys Val Leu Gin Glu Glu Tyr Arg 

305 310 315 

Lys Leu Gly Pro Leu Ser Phe Ala Glu He Asn Val Leu He Cys 

320 325 330 

Phe Phe Leu Leu Val He Leu Trp Phe Ser Arg Asp Pro Gly Phe 

335 340 345 

Met Pro Gly Trp Leu Thr Val Ala Trp Val Glu Glu Arg Lys Thr 

350 355 360 

Pro Phe Tyr Pro Pro Pro Leu Leu Asp Trp Lys Val Thr Gin Glu 

365 370 375 

Lys Val Pro Trp Gly He Val Leu Leu Leu Gly Gly Gly Phe Ala 

380 385 390 

Leu Ala Lys Gly Ser Glu Ala Ser Gly Leu Ser Val Trp Met Gly 

395 400 405 

Lys Gin Met Glu Pro Leu His Ala Val Pro Pro Ala Ala He Thr 

410 415 420 

Leu He Leu Ser Leu Leu Val Ala Val Phe Thr Glu Cys Thr Ser 

425 430 435 

Asn Val Ala Thr Thr Thr Leu Phe Leu Pro He Phe Ala Ser Met 

440 445 450 

Ser Arg Ser He Gly Leu Asn Pro Leu Tyr He Met Leu Pro Cys 

455 460 465 

Thr Leu Ser Ala Ser Phe Ala Phe Met Leu Pro Val Ala Thr Pro 

470 475 480 

Pro Asn Ala He Val Phe Thr Tyr Gly His Leu Lys Val Ala Asp 

485 490 495 

Met Val Lys Thr Gly Val He Met Asn He He Gly Val Phe Cys 

500 505 510 

Val Phe Leu Ma Val Asn Thr Trp Gly Arg Ala He Phe Asp Leu 

515 520 525 

Asp His Phe Pro Asp Trp Ala Asn Val Thr His He Glu Thr 

530 535 

<210> 26 
<211> 755 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7474129CD1 

<400> 26 

Met Lys Ala His Pro Lys Glu Met Val Pro Leu Met Gly Lys Arg 
15 10 15 

Val Ala Ala Pro Ser Gly Asn Pro Ala Val Leu Pro Glu Lys Arg 

20 25 30 

Pro Ala Glu He Thr Pro Thr Lys Lys Ser He Ser Gly Asn Cys 

35 40 45 

Asp Asp Met Asp Ser Pro Gin Ser Pro Gin Asp Asp Val Thr Glu 

50 55 60 

Thr Pro Ser Asn Pro Asn Ser -Pro Ser Ala Gin Leu Ala Lys Glu 
65 70 75 

Glu Gin Arg Arg Lys Lys Arg Arg Leu Lys Lys Arg He Phe Ala 
80 85 90 

Ala Val Ser Glu Gly Cys Val Glu Glu Leu Val Glu Leu Leu Val 
95 100 105 

Glu Leu Gin Glu Leu Cys Arg Arg Arg His Asp Glu Asp Val Pro 

110 H5 120 

Asp Phe Leu Met His Lys Leu Thr Ala Ser Asp Thr Gly Lys Thr 

125 130 135 

Cys Leu Met Lys Ala Leu Leu Asn He Asn Pro Asn Thr Lys Glu 

140 145 150 

He Val Arg He Leu Leu Ala Phe Ala Glu Glu Asn Asp He Leu 

155 160 165 

Gly Arg Phe He Asn Ala Glu Tyr Thr Glu Glu Ala Tyr Glu Gly 

170 175 180 

Gin Thr Ala Leu Asn He Ala He Glu Arg Arg Gin Gly Asp He 

185 190 195 

Ala Ala Leu Leu He Ala Ala Gly Ala Asp Val Asn Ala His Ala 

200 205 210 

Lys Gly Ala Phe Phe Asn Pro Lys Tyr Gin His Glu Gly Phe Tyr 

215 220 225 

Phe Gly Glu Thr Pro Leu Ala Leu Ala Ala Cys Thr Asn Gin Pro 

230 235 240 

Glu He Val Gin Leu Leu Met Glu His Glu Gin Thr Asp He Thr 

245 250 255 

' Ser Arg Asp Ser Arg Gly Asn Asn He Leu His Ala Leu Val Thr 

260 265 270 

Val Ala Glu Asp Phe Lys Thr Gin Asn Asp Val Val Lys Arg Met 

275 280 285 

Tyr Asp Met He Leu Leu Arg Ser Gly Asn Trp Glu Leu Glu Thr 

290 295 300 

Thr Arg Asn Asn Asp Gly Leu Thr Pro Leu Gin Leu Ala Ala Lys 

305 310 315 

Met Gly Lys Ala Glu He Leu Lys Tyr He Leu Ser Arg Glu He 

320 325 330 

Lys Glu Lys Arg Leu Arg Ser Leu Ser Arg Lys Phe Thr Asp Trp 

335 340 345 

Ala Tyr Gly Pro Val Ser Ser Ser Leu Tyr Asp Leu Thr Asn" Val 

350 355 360 

Asp Thr Thr Thr Asp Asn Ser Val Leu Glu He Thr Val Tyr Asn 

365 370, 375 

Thr Asn He Asp Asn Arg His Glu Met Leu Thr Leu Glu Pro Leu 

380 385 390 

His Thr Leu Leu His Met Lys Trp Lys Lys Phe Ala Lys His Met 

395 400 405 

Phe Phe Leu Ser Phe Cys Phe Tyr Phe Phe Tyr Asn He Thr Leu 

.410 415 420 
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Thr Leu Val Ser Tyr Tyr Arg Pro Arg Glu Glu Glu Ala He Pro 
425 430 435 

His Pro Leu Ala Leu Thr His Lys Met Gly Trp Leu Gin Leu Leu 
440 445 450 

Gly Arg Met Phe Val Leu He Trp Ala Met Cys He Ser Val Lys 
455 460 465 

Glu Gly He Ala He Phe Leu Leu Arg Pro Ser Asp Leu Gin Ser 
470 475 480 

He Leu Ser Asp Ala Trp Phe His Phe Val Phe Phe lie Gin Ala 
485 490 495 

Val Leu Val He Leu Ser Val Phe Leu Tyr Leu Phe Ala Tyr Lys 
500 505 510 

Glu Tyr Leu Ala Cys Leu -Val Leu Ala Met Ala Leu Gly Trp Ala 
515 520 525 

Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin Ser Met Gly Met Tyr 

530 535 540 

Ser Val Met He Gin Lys Val He Leu His Asp Val Leu Lys Phe 

545 550 555 

Leu Phe Val Tyr He Val Phe Leu Leu Gly Phe Gly Val Ala Leu 

560 565 570 

Ala Ser Leu He Glu Lys Cys Pro Lys Asp Asn Lys Asp Cys Ser 

575 580 585 

Ser Tyr Gly Ser Phe Ser Asp Ala Val Leu Glu Leu Phe Lys Leu 

590 " 595 600 

Thr He Gly Leu Gly Asp Leu Asn He Gin Gin Asn Ser Lys Tyr 

605 610 615 

Pro He Leu Phe Leu Phe Leu Leu He Thr Tyr Val He Leu Thr 

620 625 630 

Phe Val Leu Leu Leu Asn Met Leu He Ala Leu Met Gly Glu Thr 

635 640 645 

Val Glu Asn Val Ser Lys Glu Ser Glu Arg He Trp Arg Leu Gin 

650 655 660 

Arg Ala Arg Thr He Leu Glu Phe Glu Lys Met Leu Pro Glu Trp 

665 670 675 

Leu Arg Ser Arg Phe Arg Met Gly Glu Leu Cys Lys Val Ala Glu 

680 685 690 

Asp Asp Phe Arg Leu Cys Leu Arg He Asn Glu Val Lys Trp Thr 

695 700 705 

Glu Trp Lys Thr His Val Ser Phe Leu Asn Glu Asp Pro Gly Pro 

710 715 720 

Val Arg Arg Thr Asp Phe Asn Lys He Gin Asp Ser Ser Arg Asn 

725 730 735 

Asn Ser Lys Thr Thr Leu Asn Ala Phe Glu Glu Val Glu Glu Phe 

740 745 750 

Pro Glu Thr Ser Val 

755 

<210> 27 

<211> 301 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7481414CD1 

<400> 27 

Met Lys Ser His Pro Ala He Gin Ala Ala He Asp Leu Thr Ala 

1 5-10 15 
Gly Ala Ala Gly Gly Gly Ala Cys Val Leu Thr Gly Gin Pro Phe 

20 25 30 

Asp Thr He Lys Val Lys Met Gin Thr Phe Pro Gin Leu Tyr Lys 

35 40 45 
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Gly Leu Ala Asp Cys Phe Leu Lys Thr Tyr Asn Gin Val Gly lie 

55 DU 



50 

Arg Gly Leu Tyr Arg Gly Thr Ser Pro Ala Leu Leu Ala Tyr Val 



Thr Gin Gly Ser Val Leu Phe Met Cys Phe Gly Phe Cys Gin Gin 



70 75 



80 



85 90 
Phe Val Arg Lys Val Ala Arg Val Glu Gin Asn Ala Glu Leu Asn 
95 100 1UD 

Asp Leu Glu Thr Ala Thr Ala Gly Ser Leu Ala Ser Ala Phe Ala 

110 115 
Ala Leu Ala Leu Cys Pro Thr Glu Leu Val Lys Cys Arg Leu Gin 
125 130 
.. Thr Met Tyr Glu Met Lys Met Ser Gly Lys He Ala Gin Ser Tyr 

140 145 , 

Asn Thr He Trp Ser Met Val Lys Ser lie Phe Met Lys Asp Gly 
155 160 X0D 

Pro Leu Gly Phe Tyr Arg Gly Leu Ser Thr Thr Leu Ala Gin Glu 
170 175 1»0 

He Pro Gly Tyr Phe Phe Tyr Phe Gly Gly Tyr Glu He Ser Arg 

185 19° 
Ser Phe Phe Ala Ser Gly Gly Ser Lys Asp Glu Leu Gly Pro Val 
200 205 210 

Pro Leu Met Leu Ser Gly Gly Phe Ala Gly He Cys Leu Trp Leu 
215 220 22S 

He He Phe Pro Val Asp Cys He Lys Ser Arg He Gin Val Leu 

230 235 
Ser Met Phe Gly Lys Pro Ala Gly Leu lie Glu Thr Phe lie Ser 

245 250 
Val Val Arg Asn Glu Gly He Ser Ala Leu Tyr Ser Gly Leu Lys 

260 265 
Ala Thr Leu He Arg Ala He Pro Ser Asn Ala Ala Leu Phe Leu 
275 280 
. Val Tyr Glu Tyr Ser Arg Lys Met Met Met Asn Met Val Glu Glu 
290 295 

Tyr 



<210> 28 

<211> 515 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7481461CD1 

Se2°Jal 8 Leu Ser Gin Glu Glu Pro Asp Ser Ala Arg Gly Thr Ser 

1 5 1° 

Glu Ala Gin Pro Leu Gly Pro Ala Pro Thr Gly Ala Ala Pro Pro 

20 " 
Pro Gly Pro Gly Pro Ser Asp Ser Pro Glu Ala Ala Val Glu Lys 

35 40 43 

Val Glu Val Glu Leu Ala Gly Pro Ala Thr Ala Glu Pro His Glu 

50 55 
Pro Pro Glu Pro Pro Glu Gly Gly Trp Gly Trp Leu Val Met Leu 

Ala Ala Met Trp Cys Asn Gly Ser Val Phe Gly He Gin Asn Ala 
80 85 90 

Cys Gly Val Leu Phe Val Ser Met Leu Glu Thr Phe Gly Ser Lys 

Asp Asp Asp Lys Met Val Phe Lys Thr Ala Trp Val Gly Ser Leu 

no 11 5 
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Ser Met Gly Met lie Phe Phe Cys Cys Pro lie Val Ser Val Phe 



Thr Asp Leu Phe Gly Cys Arg Lys Thr Ala Val Val Gly Ala Ala 



Val Gly Phe Val 111 Leu Met Ser Ser Ser Phe Val Ser Ser lie 



145 



155 

Glu Pro Leu Tyr Leu Thr Tyr Gly He lie Phe Ala Cys Gly Cys 



160 



Ser Phe Ala Tyr Gin Pro Ser Leu Val lie Leu Gly His Tyr Phe 



175 180 



185 

Lys Lys Arg Leu Gly Leu Val Asn Gly lie Val Thr Ala Gly Ser 



190 " 195 



Ser Val Phe Thr lie Leu Leu Pro Leu Leu Leu Arg Val Leu lie 



205 



215 — " ~ 

Asp Ser Val Gly Leu Phe Tyr Thr Leu Arg Val Leu Cys lie Phe 



220 



Met Phe Val Leu Phe Leu Ala Gly Phe Tta Tyr Arg Pro Leu Ala 



235 



245 

Thr Ser Thr Lys Asp Lys Glu Ser Gly Gly Ser Gly Ser Ser Leu 



250 



260 

Phe Ser Arg Lys Lys Phe Ser Pro Pro Lys Lys lie Phe Asn Phe 



265 



Ala lie Phe Lys Val Thr Ala Tyr Ala Vai Trp Ala Val Gly lie 



280 



Pro Leu Ala Leu Phe Gly Tyr Phe Val Pro Tyr Val His Leu Met 



295 



Lys His Val Asn gJu Arg Phe Gin Asp Glu Lys Asn Lys Glu Val 



310 



320 

Val Leu Met Cys He Gly Val Thr Ser Gly Val Gly Arg Leu Leu 



325 



Phe Gly Arg He Ala Asp Tyr Val Pro Gly Val Lys Lys Val Tyr 



340 



L eu Gin Val Leu Ser Phe Phe Phe lie Gly Leu Met Ser Met Met 

365 370 
lie Pro Leu Cys Ser He Phe Gly Ala Leu He Ala Val Cys Leu 

380 

lie Met Gly Leu Phe Asp Gly Cys Phe lie Ser He Met Ala Pro 



355 



lie Ala Phe Glu Leu Val Gly Ala Gin A^P Val Ser Gin Ala He 

410 41 - > 
Gly Phe Leu Leu Gly Phe Met Ser He Pro Met Thr Val Gly Pro 

425 43U 
Pro He Ala Gly Leu Leu Arg Asp Lys Leu Gly Ser Tyr Asp Val 

440 ft j _ 

Ala Phe Tyr Leu Ala Gly Val Pro Pro Leu He Gly Gly Ala Val 

Leu Cys Phe He Pro Trp He His Ser Lys Lys Gin Arg Glu He 
470 

Ser Lys Thr Thr Gly Lys Glu Lys Met Glu Lys Met Leu Glu Asn 
4-85 

Gin Asn Ser Leu Leu Ser Ser Ser Ser Gly Met Phe Lys Lys Glu 

500 505 
Ser Asp Ser He He 

515 

<210> 29 

<211> 1519 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7472541CD1 



400 



49/85 



WO 02/12340 



PCTAJS01/24217 



<400> 29 

Met Ala Leu Ser val Asp Ser Ser Trp His Arg Trp Gin Trp Arg 
15 10 15 

Val Arg Asp Gly Phe Pro His Cys Pro Ser Glu Thr Thr Pro Leu 
20 25 30 

Leu Ser Pro Glu Lys Gly Arg Gin Ser Tyr Asn Leu Thr Gin Gin 
35 40 45 

Arg Val Val Phe Pro Asn Asn Ser He Phe His Gin Asp Trp Glu 
50 55 60 

Glu Val Ser Arg Arg Tyr Pro Gly Asn Arg Thr Cys Thr Thr Lys 
65 70 75 

Tyr Thr Leu Phe Thr Phe Leu Pro Arg Asn Leu Phe Glu Gin Phe 
80 85 90 

His Arg Trp Ala Asn Leu Tyr Phe Leu Phe Leu Val He Leu Ser 
95 100 105 

Trp Met Pro Ser Met Glu Val Phe His Arg Glu He Thr Met Leu 
110 115 120 

Pro Leu Ala He Val Leu Phe Val He Met He Lys Asp Gly Met 
125 130 135 

Glu Asp Phe Lys Arg His Arg Phe Asp Lys Ala He Asn Cys Ser 
140 145 150 

Asn He Arg He Tyr Glu Arg Lys Glu Gin Thr Tyr Val Gin Lys 
155 160 165 

Cys Trp Lys Asp Val Arg Val Gly Asp Phe He Gin Met Lys Cys 
170 175 180 

Asn Glu He Val Pro Ala Asp He Leu Leu Leu Phe Ser Ser Asp 
185 190 195 

Pro Asn Gly He Cys His Leu Glu Thr Ala Ser Leu Asp Gly Glu 
200 205 210 

Thr Asn Leu Lys Gin Arg Arg Val Val Lys Gly Phe Ser Gin Gin 
215 220 225 

Glu Val Gin Phe Glu Pro Glu Leu Phe His Asn Thr He Val Cys 
230 235 240 

Glu Lys Pro Asn Asn His Leu Asn Lys Phe Lys Gly Tyr Met Glu 
245 250 255 

His Pro Asp Gin Thr Arg Thr Gly Phe Gly Cys Glu Ser Leu Leu 
260 265 270 

Leu Arg Gly Cys Thr He Arg Asn Thr Glu Met Ala Val Gly He 
275 280 285 

Val He Tyr Ala Gly His Glu Thr Lys Ala Met Leu Asn Asn Ser 
290 295 300 

Gly Pro Arg Tyr Lys Arg Ser Lys He Glu Arg Arg Met Asn He 
305 310 315 

Asp He Phe Phe Cys He Gly He Leu He Leu Met Cys Leu He 
320 325 330 

Gly Ala Val Gly His Ser He Trp Asn Gly Thr Phe Glu Glu His 
335 340 345 

Pro Pro Phe Asp Val Pro Asp Ala Asn Gly Ser Phe Leu Pro Ser 
350 355 360 

Ala Leu Gly Gly Phe Tyr Met Phe Leu Thr Met He He Leu Leu 
365 370 375 

Gin Val Leu He Pro He Ser Leu Tyr Val Ser He Glu Leu Val 
380 385 390 

Lys Leu Gly Gin Val Phe Phe Leu Ser Asn Asp Leu Asp Leu Tyr 
395 400 405 

Asp Glu Glu Thr Asp Leu Ser He Gin Cys Arg Ala Leu Asn He 
410 415 420 

Ala Glu Asp Leu Gly Gin He Gin Tyr He Phe Ser Asp Lys Thr 
425 430 435 

Gly Thr Leu Thr Glu Asn Lys Met Val Phe Arg Arg Cys Thr He 
440 445 450 

Met Gly Ser Glu Tyr Ser His Gin Glu Asn Ala Lys Arg Leu Glu 
455 460 465 
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Thr Pro Lys Glu Leu Asp Ser Asp Gly Glu Glu Trp Thr Gin Tyr 
470 475 480 

Gin Cys Leu Ser Phe Ser Ala Arg Trp Ala Gin Asp Pro Ala Thr 
485 490 495 

Met Arg Ser Gin Lys Gly Ala Gin Pro Leu Arg Arg Ser Gin Ser 
500 505 510 

Ala Arg Val Pro He Gin Gly His Tyr Arg Gin Arg Ser Met Gly 
515 520 525 

His Arg Glu Ser Ser Gin Pro Pro Val Ala Phe Ser Ser Ser He 
530 535 540 

Glu Lys Asp Val Thr Pro Asp Lys Asn Leu Leu Thr Lys Val Arg 
545 550 555 

Asp Ala Ala Leu Trp Leu Glu Thr Leu Ser Asp Ser Arg Pro Ala 
560 565 570 

Lys Ala Ser Leu Ser Thr Thr Ser Ser He Ala Asp Phe Phe Leu 
575 580 585 

Ala Leu Thr He Cys Asn Ser Val Met Val Ser Thr Thr Thr Glu 
590 595 600 

Pro Arg- Gin Arg Trp Asp Asp Gin Lys He Val Glu Asn Asp His 
605 610 615 

Cys Gin Cys Leu Glu Phe Gin Gly Trp Arg Lys He Ser Gly Phe 
620 625 630 

Thr Tvr Cys Lys Ser Thr Phe He Phe Arg He Arg Gin Leu Gly 
635 640 645 

He He Ser Asn He Glu Ser Asn He Pro Leu Ser Phe Phe Gly 
650 655 660 

His Lys Val Thr He Lys Pro Ser Ser Lys Ala Leu Gly Thr Ser 
665 670 675 

Leu Glu Lys He Gin Gin Leu Phe Gin Lys Leu Lys Leu Leu Ser 
680 685 690 

Leu Ser Gin Ser Phe Ser Ser Thr Ala Pro Ser Asp Thr Asp Leu 
695 700 705 

Gly Glu Ser Leu Gly Ala Asn Val Ala Thr Thr Asp Ser Asp Glu 
710 715 720 

Arg Asp Asp Ala Ser Val Cys Ser Gly Gly Asp Ser Thr Asp Asp 
725 730 735 

Gly Gly Tyr Arg Ser Ser Met Trp Asp Gin Gly Asp He Leu Glu 
740 745 750 

Ser Gly Ser Gly Thr Ser Leu Glu Glu Ala Leu Glu Ala Pro Ala 
755 760 765 

Thr Asp Leu Ala Arg Pro Glu Phe Cys Tyr Glu Ala Glu Ser Pro 
770 775 780 

Asp Glu Ala Ala Leu Val His Ala Ala His Ala Tyr Ser Phe Thr 
785 790 795 

Leu Val Ser Arg Thr Pro Glu Gin Val Thr Val Arg Leu Pro Gin 
800 805 810 

Gly Thr Cys Leu Thr Phe Ser Leu Leu Cys Thr Leu Gly Phe Asp 
815 820 825 

Ser Val Arg Lys Arg Met Ser Val Val Val Arg His Pro Leu Thr 
830 835 840 

Glv Glu He Val Val Tyr Thr Lys Gly Ala Asp Ser Val He Met 
845 850 855 

Asp Leu Leu Glu Asp Pro Ala Cys Val Pro Asp He Asn Met Glu 
860 865 870 

Lys Lys Leu Arg Lys He Arg Ala Arg Thr Gin Lys His Leu Asp 
875 880 885 

Leu Tyr Ala Arg Asp Gly Leu Arg Thr Leu Cys He Ala Lys Lys 
890 895 900 

Val Val Ser Glu Glu Asp Phe Arg Arg Trp Ala Ser Phe Arg Arg 
905 910 915 

Glu Ala Glu Ala Ser Leu Asp Asn Arg Asp Glu Leu Leu Met Glu 
920 925 930 

Thr Ala Gin His Leu Glu Asn Gin Leu Thr Leu Leu Gly Ala Thr 
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935 940 945 

Gly lie Glu Asp Arg Leu Gin Glu Gly Val Pro Asp Thr lie Ala 

950 955 
Thr Leu Arg Glu Ala Gly He Gin Leu Trp Val Leu Thr Gly Asp 

965 970 
Lys Gin Glu Thr Ala Val Asn lie Ala His Ser Cys Arg Leu Leu 

Asn Gin Thr Asp Thr Val Tyr Thr lie Asn Thr Glu Asn Gin GJu 
995 1000 iuu:> 

Thr Cys Glu Ser lie Leu Asn Cys Ala Leu Glu Glu Leu Lys Gin 

1010 1015 xuziu 

Phe Arg Glu Leu Gin Lys Pro Asp Arg Lys Leu Phe Gly Phe Arg 

1025 1030 
Leu Pro Ser Lys Thr Pro Ser lie Thr Ser Glu Ala Val Val Pro 

1040 1°45 1030 

Glu Ala Gly Leu Val He Asp Gly Lys Thr Leu Asn Ala lie Phe 

1055 1060 iU " 

Gin Gly Lys Leu Glu Lys Lys Phe Leu Glu Leu Thr Gin Tyr Cys 

1070 1° 75 auou 

Arg Ser Val Leu Cys Cys Arg Ser Thr Pro Leu Gin Lys Ser Met 

1085 i°90 *y * 

lie Val Lys Leu Val Arg Asp Lys Leu Arg Val Met Thr Leu Ser 

1100 n° 5 
lie Gly Asp Gly Ala Asn Asp Val Ser_Met He Gin Ala Ala_Asp 

lie Gly He Gl/lfe Ser Gly Gin Glu Gly Met Gin Ala Val Met 

1130 H 35 , 

Ser Ser Asp Phe Ala He Thr Arg Phe Lys His Leu Lys Lys Leu 

1145 1150 
Leu Leu Val His Gly His Trp Cys Tyr Ser Arg Leu Ala Arg Met 

1160 ll 65 I" 

Val Val Tyr Tyr Leu Tyr Lys Asn Val Cys Tyr Val Asn Leu Leu 

1175 1180 1103 

Phe Trp Tyr Gin Phe Phe Cys Gly Phe Ser Ser Ser Thr Met He 

1190 1195 -l^uw 

Asp Tyr Trp Gin Met He Phe Phe Asn Leu Phe Phe Thr Ser Leu 

1205 1210 
Pro Pro Leu Val Phe Gly Val Leu Asp Lys Asp He Ser Ala Glu 

1220 1225 Z 

Thr Leu Leu Ala Leu Pro Glu Leu Tyr Lys Ser Gly Gin Asn Ser 

1235 1240 
Glu Cys Tyr Asn Leu Ser Thr Phe Trp^He Ser Met Val Asp_Ala 

Phe Tyr Gin Serpen He Cys Phe Phe He Pro Tyr Leu Ala Tyr 

1265 1270 xz/j 

Lys Gly Ser Asp He Asp Val Phe Thr Phe Gly Thr Pro He Asn 

1280 I 285 
Thr He Ser Leu Thr Thr He Leu Leu His Gin Ala Met Glu Met 

1295 1300 1JU3 

Lys Thr Trp Thr He Phe His Gly Val Val Leu Leu Gly Ser Phe 

1310 1315 J-Jisu 

Leu Met Tyr Phe Leu Val Ser Leu Leu Tyr Asn Ala Thr Cys Val 

1325 1330 
lie Cys Asn Ser Pro Thr Asn Pro Tyr Trp Val Met Glu Gly Gin 

1340 1345 1J3U 

Leu Ser Asn Pro Thr Phe Tyr Leu Val Cys Phe Leu Thr Pro Val 

1355 1360 IJ03 

Val Ala Leu Leu Pro Arg Tyr Phe Phe Leu Ser Leu Gin Gly Thr 

1370 1375 ■ L ^°' J 

Cys Gly Lys Ser Leu He Ser Lys Ala Gin Lys He Asp Lys Leu 

1385 1390 -"-J" 

Pro Pro Asp Lys Arg Asn Leu Glu He Gin Ser Trp Arg Ser Arg 

1400 1405 - L * J - U 
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Gin Arg Pro Ala Pro Val Pro Glu.ValMa Arg Pro Thr His^is 

Pro Val Ser Ser"S Thr Gly Gin Asp Phe Ser Ala Ser Thr Pro 

1430 1435 !; 

Lys Ser Ser Asn Pro Pro Lys Arg Lys His Val Glu Glu Ser Val 

1445 1450 ****** 

Leu His Glu Gin Arg Cys Gly Thr Glu Cys Met Arg Asp Asp Ser 

1460 1465 " 

Cys Ser Gly Asp Ser Ser Ala Gin Leu Ser Ser Gly Glu His Leu 

1475 1480 x "* 

Leu Gly Pro Asn Arg He Met Ala Tyr Ser Gly Gly Gin Thr Asp 

1490 1495 
Met- Cys Arg Cys Ser Lys Arg Ser Serais Arg Arg Ser Ginger 



Ser Leu Thr lie 



<210> 30 

<211> 1585 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6999183CD1 

Me°t 0 Ser 0 Lys Arg Arg Met Ser Val Gly Gin Gin Thr Trp Ala Leu 

Leu Cys Lys Asn Cys Leu Lys Lys Trp Arg Met Lys Arg Gin Thr 
20 

Leu Leu Glu Trp Leu Phe Ser Phe Leu Leu Val Leu Phe Leu Tyr 



Leu Phe Phe Ser Asn Leu His Gin Val His Asp Thr Pro Gin Met 



50 



55 



60 



Ser Ser Met Asp Leu Gly Arg Val Asp Ser Phe Asn Asp Thr Asn 



Tyr Val lie Ala Al Ala Pro Glu Ser hyl Thr Thr Gin Glu lie 
on 85 ^ 



30 

Met Asn Lys Val Ala Ser Ala Pro Phe Leu Met Ala Gly Arg Thr 



He Met Gly Trp Pro Asp Glu Lys Ser Met Asp Glu Leu Asp Leu 

110 115 
Asn Tyr Ser lie Asp Ala Val Arg Val lie Phe Thr Asp Thr Phe 

Ser Tyr His Leu Lys Phe Ser Trp Gly His Arg lie Pro Met Met 
140 i 

Lys Glu His Arg Asp His Ser Ala His Cys Gin Ala Val Asn Glu 
155 160 n 

Lys Met Lys Cys Glu Gly Ser Glu Phe. Trp Glu Lys Gly Phe Val 

Ala Phe Gin Ala Ma He Asn Ala Ala lie lie Glu lie Ala Thr 

185 190 
Asn His Ser Val Met Glu Gin Leu Met Ser Val Thr Gly Val His 

200 

Met Lys lie Leu Pro Phe Val Ala Gin Gly Gly Val Ala Thr Asp 

215 Ad&u 

Phe Phe lie Phe Phe Cys lie lie Ser Phe Ser Thr Phe lie Tyr 

230 2 " 

Tyr Val Ser Val Asn Val Thr Gin Glu Arg Gin Tyr lie Thr Ser 

245 250 ^ 

Leu Met Thr Met Met Gly Leu Arg Glu Ser Ala Phe Trp Leu Ser 

260 265 



100 
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Trp Gly Leu Met Tyr Ala Gly Phe lie Leu He Met Ala Thr Leu 

Met Ala Leu He III Lys Ser Ala Gin lie Val Val Leu Thr Gly 

290 295 
Phe Val Met Val Phe Thr Leu Phe Leu Leu Tyr Gly Leu Ser Leu 
305 310 JJ - U 

lie Thr Leu Ala Phe Leu Met Ser Val Leu He Lys Lys Pro Phe 

320 325 « i 

Leu Thr Gly Leu Val Val Phe Leu Leu lie Val Phe Trp Gly lie 

335 340 
Leu Gly Phe Pro Ala Leu Tyr Thr His Leu Pro Ala Phe Leu Glu 
350 355 



Trp Thr Leu Cys Leu Leu Ser Pro Phe Ala Phe Thr Val Gly Met 

365 370 ■ 

Ala Gin Leu lie His Leu Asp Tyr Asp Val Asn Ser Asn Ala His 



380 

Leu Asp Ser Ser Gin Asn Pro Tyr Leu lie lie Ala Thr Leu Phe 

Met Leu Val Phe As P Thr Leu Leu Tyr Leu Val Leu Thr Leu Tyr 

410 415 
Phe Asp Lys He Leu Pro Ala Glu Tyr Gly His Arg Cys Ser Pro 

425 430 n 

Leu Phe Phe Leu Lys Ser Cys Phe Trp Phe Gin His Gly Arg Ala 

440 44b 
Asn His Val Val Leu Glu Asn Glu Thr Asp Ser Asp Pro Thr Pro 

455 460 , 

Asn Asp Cys Phe Glu Pro Val Ser Pro Glu Phe Cys Gly Lys Glu 

470 475 
Ala He Arg He Lys Asn Leu Lys Lys Glu Tyr Ala Gly Lys Cys 

485 490 i 

Glu Arg Val Glu Ala Leu Lys Gly Val Val Phe Asp He Tyr Glu 

500 505 r 

Gly Gin He Thr Ala Leu Leu Gly His Ser Gly Ala Gly Lys Thr 

515 ^20 3 " 

Thr Leu Leu Asn He Leu Ser Gly Leu Ser Val Pro Thr Ser Gly 

530 535 ? 

Ser Val Thr Val Tyr Asn His Thr Leu Ser Arg Met Ala Asp lie 

545 550 ? 

Glu Asn He Ser Lys Phe Thr Gly Phe Cys Pro Gin Ser Asn Val 

560 565 J ' u 

Gin Phe Gly Phe Leu Thr Val Lys Glu Asn Leu Arg Leu Phe Ala 

575 580 003 

Lys He Lys Gly He Leu Pro His Glu Val Glu Lys Glu Val Leu 

59fr 595 
Leu Leu Asp Glu Pro Thr Ala Gly Leu Asp Pro Leu Ser Arg His 

605 .610 OJ - 

Arg He Trp Asn Leu Leu Lys Glu Gly Lys Ser Asp Arg Val He 

Leu Phe Ser Thr Gin Phe He Asp Glu Ala Asp He Leu Ala Asp 

635 640 
Arg Lys Val Phe He Ser Asn Gly Lys Leu Lys Cys Ala Gly Ser 

650 655 
Ser Leu Phe Leu Lys Lys Lys Trp Gly lie Gly Tyr His Leu Ser 

Leu His Leu Asn "u Arg Cys Asp Pro Glu Ser He Thr Ser Leu 
680 "5 o:7U 

Val Lys Gin His He Ser Asp Ala Lys Leu Thr Ala Gin Ser Glu 

Glu Lys Leu Val Tyr He Leu Pro Leu Glu Arg Thr Asn Lys Phe 

7^0 715 
Pro Glu Leu Tyr Arg Asp Leu Asp Arg Cys Ser Asn Gin Gly He 

725 73 ^ 
Glu Asp Tyr Gly Val Ser He Thr Thr Leu Asn Glu Val Phe Leu 
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740 745 750 

Lys Leu Glu Gly Lys Ser Thr lie Asp Glu Ser Asp lie Gly lie 

755 ' D ° 
Trp Gly Gin Leu Gin Thr Asp Gly Ala Lys Asp He Gly Ser Leu 

770 775 
Val Glu Leu Glu Gin Val Leu Ser Ser Phe His Glu Thr Arg Lys 



Thr lie Ser Gly Val Ala Leu Trp Arg Gin Gin Val Cys Ala lie 



Ala Lys val Arg Phe Leu Lys Leu Lys Lys Glu Arg Lys Ser Leu 



805 ~ 810 



Q15 

Trp Thr lie Leu Leu Leu Phe Gly He Ser Phe He Pro Gin Leu 



820 



830 

Leu Glu His Leu Phe Tyr Glu Ser Tyr Gin Lys Ser Tyr Pro Trp 



835 



345 ~ - ~ 

Glu Leu Ser Pro Asn Thr Tyr Phe Leu Ser Pro Gly Gin Gin Pro 



850 



860 

Gin Asp Pro Leu Thr His Leu Leu Val lie Asn Lys Thr Gly Ser 



865 



Thr lie Asp Asn Phe Leu His Ser Leu A^g Arg Gin Asn lie Ala 

Asn Gly Thr Asp Asp Pro 
910 915 
Gly Asp Glu Lys Asp His 
925 930 
Arg Leu Asn Cys Phe Pro 
940 9 45 
Leu Leu Gly He Phe Asn 
955 96C 
Ser Ser Glu His He Gin Thr Asp Arg Ser Thr Phe Phe Glu Glu 

965 ^™ 
His Met Asp Tyr Glu Tyr Gly Tyr Arg Ser Asn Thr Phe Phe Trp 

980 "5 ^ 

lie Pro Met Ala Ala Ser Phe Thr Pro^Tyr lie Ala Met Server 

lie Gly Asp Tyr Lys Lys Lys Ala His Ser Gin Leu Arg lie Ser 

1010 1015 
Gly Leu Tyr Pro Ser Ala Tyr Trp Phe Gly Gin Ala Leu Val Asp 



880 



890 



lie Glu val Asp Ala Phe Gly Thr Arg Asn Gly Thr Asp Asp Pro 



905 

Ser Tyr Asn Giy Ala He He Val Ser Gly Asp Glu Lys Asp Hxs 



920 - — 

Arg Phe Ser He Ala Cys Asn Thr Lys Arg Leu Asn Cys Phe Pro 



93S 940 945 

Val Leu Leu Asp Val He Ser Asn Gly Leu Leu Gly He Phe Asn 
950 955 - 



Leu lie Gin Ile^eu Cys Ser lie GlyT^ Val Ser Ser Leu Val 



1025 1030 1035 

Val Ser Leu Tyr Phe Leu He Leu Leu Leu Met Gin He Met Asp 

1040 1045 ^ 

Tyr He Phe Ser Pro Glu Glu He Ile^he He He Gin Asn^eu 

/ Tyr Val Ser Ser Leu Val 
1070 1075 108° 

Phe Leu Thr Tyr Val He Ser Phe He Phe Arg Asn Gly Arg Lys 

1085 1090 \_ 

Asn Ser Gly He Trp Ser Phe Phe Phe Leu He Val Val He Phe 

1100 110b 
Ser lie Val Ala Thr Asp Leu Asn Glu Tyr Gly Phe Leu Gly Leu 

1115 1120 
Phe Phe Gly Thr Met Leu He Pro Pro Phe Thr Leu He Gly Ser 

1130 1135 x 

Leu Phe He Phe Ser Glu He Ser Pro Asp Ser Met Asp Tyr Leu 

1145 1150 
Gly Ala Ser Glu Ser Glu He Val Tyr Leu Ala Leu Leu lie Pro 

1160 Hod 
Tyr Leu His Phe Leu He Phe Leu Phe^le Leu Arg Cys Leu^lu 

Met Asn Cys Arg^ys Lys Leu Met Arg Lys Asp Pro Val Phe Arg 

1190 119b 
lie Ser Pro Arg Ser Asn Ala He Phe Pro Asn Pro Glu Glu Pro 

1205 1210 
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Glu Gly Glu Glu Glu Asp lie Gin Met Glu Arg Met Arg Thr Val 
1220 1225 1230 

Asn Ala Met Ala Val Arg Asp Phe Asp Glu Thr Pro Val He He 

1235 1240 1245 

Ala Ser Cys Leu Arg Lys Glu Tyr Ala Gly Lys Lys Lys Asn Cys 

1250 1255 1260 

Phe Ser Lys Arg Lys Lys Thr He Ala Thr Arg Asn Val Ser Phe 

1265 1270 1275 

Cys Val Lys Lys Gly Glu Val He Gly Leu Leu Gly His Asn Gly 

1280 1285 1290 

Ala Gly Lys Ser Thr Thr He Lys Met He Thr Gly Asp Thr Lys 

1295 1300 1305 

Pro Thr Ala Gly Gin Val He Leu Lys Gly Ser Gly Gly Gly Glu 

1310 1315 1320 

Pro Leu Gly Phe Leu Gly Tyr Cys Pro Gin Glu Asn Ala Leu Trp 

1325 1330 1335 

Pro Asn Leu Thr Val Arg Gin His Leu Glu Val Tyr Ala Ala Val 

1340 1345 1350 

Lys Gly Leu Arg Lys Gly Asp Ala Met He Ala He Thr Arg Leu 

1355 ~ 1360 1365 

Val Asp Ala Leu Lys Leu Gin Asp Gin Leu Lys Ala Pro Val Lys 

1370 1375 1380 

Thr Leu Ser Glu Gly He Lys Arg Lys Leu Cys Phe Val Leu Ser 

1385 1390 1395 

He Leu Gly Asn Pro Ser Val Val Leu Leu Asp Glu Pro Ser Thr 

1400 1405 1410 

Gly Met Asp Pro Glu Gly Gin Gin Gin Met Trp Gin Val He Arg 

1415 1420 1425 

Ala Thr Phe Arg Asn Thr Glu Arg Gly Ala Leu Leu Thr Thr His 

1430 1435 1440 

Tyr Met Ala Glu Ala Glu Ala Val Cys Asp Arg Val Ala He Met 

1445 1450 1455 

Val Ser Gly Arg Leu Arg Cys He Gly Ser He Gin His Leu Lys 

1460 1465 1470 

Ser Lys Phe Gly Lys Asp Tyr Leu Leu Glu Met Lys Leu Lys Asn 

1475 1480 1485 

Leu Ala Gin Met Glu Pro Leu His Ala Glu He Leu Arg Leu Phe 

1490 1495 1500 

Pro Gin Ala Ala Gin Gin Glu Arg Phe Ser Ser Leu Met Val Tyr 

1505 1510 1515 

Lys Leu Pro Val Glu Asp Val Arg Pro Leu Ser Gin Ala Phe Phe 

1520 1525 1530 

Lys Leu Glu lie Val Lys Gin Ser Phe Asp Leu Glu Glu Tyr Ser 

1535 1540 1545 

Leu Ser Gin Ser Thr Leu Glu Gin Val Phe Leu Glu Leu Ser Lys 

1550 1555 1560 

Glu Gin Glu Leu Gly Asp Leu Glu Glu Asp Phe Asp Pro Ser Val 

1565 1570 1575 

Lys Trp Lys Leu Leu Leu Gin Glu Glu Pro 

1580 1585 

<210> 31 

<211> 1129 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2194064CB1 

<400> 31 

gcggccgcag ccttcgcgat aaacgggctg tcctacgggc tgctgcgctc gctgggcctt 60 
gccttccctg accttgccga gcactttgac cgaagcgccc aggacactgc gtggatcagc 120 
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gccctggccc tggccgtgca gcaggcagcc agccccgtgg ^agcgccct ^gcacgcgc 180 
tggggggccc gccccgtggg tgatggttgg ^gtcctc gctcgctgg g g^ 3QQ 

?gcgatcacc ctccacctca ccccctgtgg cgccctgctg ctacccctgg tccttcctgg 540 

SSSS SSS S S3SSS SSS SS SSS2 - 

ttacaSac ^ggctcccc acgct??aga ccggggcctg gggggatacg ^gcagcgct 720 
Sggtggcc gtggctgcga tgggggatgc gggcgcccgg ctggtctgcg ggtggctggc 780 
agaccaaggc tgggtgcccc tcccgcggct gctggccgta ttcggggctc tg ^ QQQ 
■ ggggctgtgg gtggtggggc tggtgcccgt ^gggcggc ^gagagct 6Q 
cctgctggcc gcggctgtgg cctatgggct ^gcgcgggg ^tacgccc ^tgg 102Q 
cggtgtactc cccgggctgg tgggcgtcgg aggtgtggtg ^aggccacag g^sy » Q 
gKgctgatg agcctcgggg ggctcctggg ccctcccctg tcaggtaagg acctgagctc 10 
acagatctgc ctacaactat cctctgcccc tggggttcga ggcttctaa 



<210> 32 

<211> 2699 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2744094CB1 

<400> 32 



22£L t.ttatt«t ,a.,ct r ac «™c t f= t M 

SS SS SS s= ss ss •■• 

SS. SS SS SS sab s 

tctccctccc aggttacatg gtctcttgca taattttctt ^ttgtggtg ccg J 

tcttaacgat cttcagcttc tggtggctga gctactggtt J3^QQ9C ^^|| a 600 

atagcagccg agagagcaat ggaaccatgg cagacctggg ^=^tgca gcaatcctc 6 
aactgtcctt ctaccagctg gtgtacgggc tcaacgccct sctcctcatc ^gcggggg 

tctgctcctc agggattttc accaaggtca cgaggaaggc ^ccacggcc ctgcacaaca 72 

agctcttcaa caaggttttc cgctgcccca tgagtttctt t^catc cccatc??tt 840 

ggcttttgaa ctgcttcgca ggggacttgg aacagctgga c ^gctctcg 900 

SSSS S2S2 SSSSS £g g - 

asss ssss ssss sgs B S iss 

ffiSS SEES S2SS «X -ESS SS 

ccttggctgt tgccctgttc gtggcttttg ^atttcctc ^ccccctac tcctt g 

sssss =ss ss = - «S gs 

atcrtctcaqa age tcctt ta cacatggaag gcacaagttg tccccagggg uyy » QQ 

isisJisffs^iil 

£525 SSSS SSSE SS SS SSS- ~ uj. 

ssss ssss ssss gsg H E is as 

tcatccttat egatgaagee acagcctcca ttgacatgga jca^cacc ccga g 
gcacaatccg tgaagectte cagggctgea ccgtgctcgt cattgcccac ^geca 
ctgtgctgaa ctgtgaccgc atcctggtta tgggcaatgg gaaggtggta gaatttgatc 
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5SSSS SSSSS S£3££ W g ~« 

cttctcctgg aagcaggggt aaatgtaggg 9gg9tgggga "gctggatg gaa |« 
aataggctac ttgatggctc tcaagacctt agaaccccag jaccatctaa gjcatggg 
tcagtgatca tgtggttctc cttttaactt a^ctgaa taattttata 9^ 2520 
agcttatagt tttctgatct ^gttagaag tgttgcaaat S^gtactg J 258Q 

?t"a ctct^tgc! caggftggag cgcagtggcg caatctcagc 2640 

tcactg^agc ctSgcctcc caggttcaag caattctcct gcctcagcct cctgaatag 2699 



<210> 33 
<211> 6369 
<212> DNA 
<213>.Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 2798241CB1 



: go 0 caccaca cccagctaat ttttata gt "agtagaga cagggtttca -atgttggt 60 q 
caggctggtc tcaaactcct gacttcgtga tctgcccacc ttggcctccc aaag g gg 

gattacaggc gtgagccacc 9cacccggtc agctattttc tacatgcttc y 24Q 

taatattgga ttgtatgaga ctttgggttt tgtgttaata ££acagaa g « 30Q 

tttctcttag caggctgtca accaggttag 9ttcaggtca t^ttcta °~ 36Q 

ttgaactgta gttgtcattt tagtttattt "caaaaact £tgcagtac Jtttttg^ ^ 

tgtcttgtgt gtgccttgca gtgaacagtc tggatttgga ^gtggtctg 48Q 

S2SSS SS2E =sa s l i - ~- js 

SSS !SS SS2SS 33SS » SSSS 
SEES SS3SS I— ~ f -S£2 SS 

gttactgtag tctggccctg ttaccatggg ^tgcttgca gtggggata ^999 ^ 
cagaaaagaa aaaaagattt gctatttcta "ttctccct gagca g 960 
cccattcctc aattcaaagc taaggcttct t^tggagctg jctccgcgro «L 1020 
gataccaaag gagaaaaagt accactgttg atatggtggt Jtttcaaatt <^ 1Q8Q 
ctatttcaca tgccttgttt acttttcaga ^tgacagat tgctgctcca tgca g 
ccagtttcct aagagagaca 9cttggagta tgcttaatcc ^cttacctg gg g ^ 
agctgcttat tttgccgtta aaaattacat gcagtttact 9cgtggc ttagggaggt 1260 
glttgttttt cctctttaat aggtttattc agaaaacatg tccactgcaa UWTR 132Q . 
aggagtttgg agacagacca gaacacttct actgaagaat ggttaatatt 1380 

caaaaagagt agtgttcagg aaattctttt tccactattt "tttattcc 99 UQ 
aattagcatg atgcatccaa ataagaaata tgaagaagtg ^aatatag ^ 
tatggacaag tttactottt faatctaat tcttggatat ^ctccagcg 1560 
aagcagcatc atgcagaaag tgtctactga tcatctacct 9atgt actfctgtagg ±62Q 

atatacaaat gaaaaagaaa tgttaacatc cagcctctc a » tgattccagt 1680 
tgtggttttc aaagactcca tgtcctatga acttcgtttt tttcctgata eg ^ 174Q 

atcttctatt tatatggatt caagagctgg c t9ttcaaaa "aegegagg » 1800 
ctggtcctca ggtttcacag ttttacaagc atccatagat ^gecatta tacagttg 
gaccaatgtt tctctttgga aggagctgga Qtcaactaaa S^^tacca grog 
?gctgttgta gaaatagata cctttccccg jggagtaatt ttaatatacc tjj*^ ig80 
attttcacct tttggatact ttttggcaat teatategta gcaga g 2Q40 
aaaagaattt ttaaagataa tgggacttca t^tactgee ttttggcttt ^SJW ^ 
tctatataca agtttaattt ttcttatgtc ccttcttatg 9cagtcattg B 
tttgttattt cctcaaagta gcagcattgt 9atatttctg ^ttttttcc ^tagga 
atcatctgta ttttttgett taatgetgae acctcttttt ^aaaatcaa 99 228Q 

aatagttgaa ttttttgttj ctgtggcttt tggatttatt Bjccttatg ^ 
agaaagtttt cccaaatcgt tagtgtggct "tcagtcc 9 cattttcaaa 2400 

gattggtatt 9cacaggtca tgcatttaga agattttaa ^ ttaatagtat 24 60 

22ESS ~ctg "tga tcaagtcatt ccaggggaat ttggcttacg 2520 
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gagatcatct ttatattttc tgaagccttc atattggtca -gagcaaaa gaattatga 2580 
ggagttatca gagggcaatg ttaatggaaa tattagtttt J^gaaatta "BB 21QQ 
??c?tcagaa tttgtaggaa aagaagccat aagaattagt ^tattcag g 2760 
aaagaagggt gaaaatgtgg aggctttgag aaatttgtca ^cg atattct ttg 2820 

gattactgcc ttacttggcc acagtggaac aggaaagagt acattg g tcagaaat 28 80 
tggactctgc ccaccttctg atgggtttgc atctatatat ^ atatacactt 2940 
agatgaaa?g tttgaagcaa gaaaaatgat tggcatttgt £cacagttag £J 3000 
tgatgttttg acagtagaag aaaatttatc aattttggct tcaatcaaag JJ^^ 306() 
caacaatata atacaagaag tgcagaaggt tttactagat «"»V" taggaattgc 3120 
agataaccaa gctaaaaaat taagtggtgg tcaaaaaaga aagctgtcat ^ 
tgttcttggg aacccaaaga tactgctgct agatgaacca tgacagtgtt 3240 

ttctcgacat attgtatgga atcttttaaa atacagaaaa c« tgatatcaca 3300 

cagtactcat ttcatggatg aagctgacat tcttgcagat aggaaagctg ^ 336Q 
agjaatgctg aaatgtgttg gttcttcaat ftcctcaaa agtaaatggg gg J* 3420 
ccgcctgagc atgtacatag acaaatattg tgccacagaa t^ctttctt cac gg 
acaaca?a?a cctggagcta ctttattaca acagaatgac caacaacttg gt^J^ 354() 
gcctttcaag gacatggaca aattttcagg tttgttttct gc 9 tttt aaagct 3600 
tttgggtgtc atttcttatg gtgtttccat sacgactttg gaaga g fc 3660 

agaagttgaa gcagaaattg accaagcaga ttatagtgta t«actcagc J 3720 
ggaagaaltg gattcaaaat cttttgatga aatggaacag ajcttactta tt 3780 
aaccaaggct tctctagtga gcaccatgag cctttggaaa ^aacagatg t 3840 

aaagtt?cat ttctttacct tgaaacgtga aagtaaatca ggagatcag 3g00 
gcttttaatt tttttcacag ttcagatttt gagacaaacc 3960 

?gctgtggtt cccatcaaac ttgttccaga cttatatttt ctaaaacctg 00^^ 4Q 
acataaatac aaaacaagtc tgcttcttca aaattccgcc g « gtgactatgt 4080 
tattagcttt ttcacaagcc agaacataat ^acgatg attaa g | c « atgtttt 4 140 
atccgtggct ccccatagtg cggctttaaa tgtggtgcat tcag gg 4200 
tgcagctgtt ttcaacagta ctatggttta ttctttacct atatcagug fc 4260 

taactaclat ctttatcatt taaatgtgac tgaaaccatc Jjgatctgga ^ac 
c?ttcaagaa attactgata tagtttttaa aattgagctg tattttcaag cage g ^ 
tggaatcatt gttactgcaa tgccacctta ^ttgccatg ^aaatgcag ag ^ Q 
ga?caaagct tatactcaac ttaaactttc aggtcttttg ^atetgeat f t 4500 
acaagctgtt gttgatatcc ccttattttt tatcattctt attc g g ttttttg 4 560 
attggcattt cattatggat tatattttta tactgtaaag ttcct g g fc 9 acctttaa 4620 
ccttattggt tatgttccat ^agttattct gtcacttat £9 cgtt ggcttg 4680 

gaaaatttta aataccaaag aattttggtc atttatctat cc g gg a catta 4740 

tattgeaate actgaaataa ctttctttat ^gatacaca a«g ctttcata aa 4800 

tgccttttgt atcatcattc caatctatcc acttctaggt ^^J 3 ataggctttc 4860 
gatttcttgg aagaatgtac ^aaaaaatgt gacacctat aatccatggg 4920 
agtagctgtt atategcett acctgeagtg tgtactgtgg a"t acctttcaac 4980 

SagLalaa tatggaggca gatcaataag aaaagatccc "tttcagaa acc ^ 
gaagtctaaa aataggaagc ttccagaacc accagacaat 9aggatgaag acg g ^ 
laaagctgaa agactaaagg tcaaagagct gatgggttgc ^gtgttgtg JJ^J^ 5160 
atccattatg gtcagcaatt tgcataaaga atatgatgac » aaggagagat 5220 

aagaaaagta aagaaagtgg caactaaata ^atctctttc t^gtg ttctggttgg 5 280 
cttaggacta ttgggtccaa atggtgctgg ^aaaagcaca attat « agacaagtga 5340 
tgatattgaa ccaacttcag gecaggtatt "taggagat tattcttcag aga g g- 
agatgatgat tcactgaagt ? tatgggtta ctgtcctcag ataaaccctt tggg^ 
tactacattg caggaacatt ttgaaattta Jggagctgcc a gg a atcttca g aa 5520 
catgaaagaa gtcataagtc Sfaataacaca tgcacttgat ttaaaag tatgct 5580 

gactgtaaag aaactacctg caggaatcaa acgaaagttg ^^" 9 CC aaagccaa 5640 
agggaatcct cagattactt tgctagatga accatctaca ctgetattet 5700 

aclgcacatg tggcgagcaa ttcgaactgc atttaaaaac agaaageggg ggc ^ 
gaccactcac tatatggagg aggcagaggc tgtctgtgat gaaaagg cta 5820 

?gggcagtta agatgtatcg gaacagtaca acatctaaag Jgaaatttg g ^ 5g80 
ctttttggaa attaaattga aggactggat agaaaaccta gaagc aga g 5940 
agaaattcag tatattttcc caaatgeaag ccgtcaggaa jJJJ J agc tggaaga 6000 
ttataaaatt cctaaggaag atgttcagtc c«ttcaca tggaacaggt 6060 

agctaaacat gettttgeca ttgaagaata tagcttttct ^aag c £ ttaaa0 ag 6120 

2SSSS tgggaacgS S &AS« £ ~ gj 

SSSSS SSKSS SSS3= St«SS- — «-= 
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»M WMM tetwtttt =«t, M ta, ..ctatBta t«tta...a 6360 
aaaaaaaaa 



<210> 34 

<211> 2558 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3105257CB1 

<220> 

<221> unsure 
<222> 2523 

<223> a, t, c, g, or other 



ftggggcgcg gggccggcgc tgctctgggg cgttggagcc gcgcgccgct ggajgagctg 60 q 
ctgccggggc gggggtctgg ac^cm* ^ccacgcg ggcggcgcgg g^gggcggcg 180 
gctgtgggct tgggcccggc agctgcaggg faggaggccc ggcgg a «a ggaag 2 40 

ccttcccggg acgaccagcg gctacgaccc 'tggcgcccg 9^ctcgga gg ™ 30Q 
ctcctggggc tggagtaccc tgagcgccag aswctggcag WOT™ 360 
atgtccggtg ttatctccat gtctgcccct ttctttctgg ^agatcat ^ 
tataccaacc ccactgtgga ctacagcgac aacctgaccc ^ctctgcc 48Q 
gccgtgtttc tgtgtggtgc tgccgccaat occattcgtfl £S£St gggcaggag 540 
ggtcagcgca ttgtgaatag gctgagaact tcattattct ^tccattcc g gg y ^ 
Sttgctttct ttgacaagac tcgcacagga ^attgatta ^ccgcctccc J c 660 
gcactcctgg ggcgctcagt oactguaao ctctcagatg |^caggg JJJ^ ?20 
gcttccgtag gcatcagtat gatgtttttt Q^tcaccta atctgg acggaaactg 780 
agcgtggtgc ctccagtgtc aatcattgct gtaatttatg acgtattgga 840 

accaaagtca ctcaggattc cctggcacaa S^^actcagc tagctgagga J g00 
aatgtaagaa ctgttcgagc ttttgggaaa S^atgactg aaatcgagaa * g6Q 
aaagtggacc atgtaatgca gttagcaagg aaagaggcat tcgcccggg ^ 1020 
ggagcaactg ggctctccgg aaacctgatc ^tttctg ^ctgtaca ^ c 1QBQ 
ctgatgggca gtgcccacat saccgtgggt ^actctctt ^cc« g g 1140 

tgggttggaa taagcattgg aggtctgagc tctttctact cggag g ttttaacgag 1200 
ggtgcagggg ggcgcctctg ^agctcctg WWW ° g a *? aagaa cgtg catttt 1260 
ggggtcatct taaatgagaa aagcttccag S^gcttcgg agt » cat tccgtca 1320 
gccLtccag ctcgcccaga ggtgcccata tttcaggatt tcagcctttc 1380 
ggatctgtca cggcactggt tggcccaagt ^tctggca aatcaac^ g ccgfc 144Q 
ctgctgaggt tgtacgaccc tgcttctgga actattagtc g ^ acccat tttg 1500 
cagctaaacc cagtgtggct gagatccaaa attgggacag tgag gg ctctgtgacc 156O 
ttttcttgct ctattgctga gaacattgct tatggtgctg ccggaatttc 1620 

gctgaggaaa tccagagagt ^gaagtg ^^acag ggcc tgggcagaaa 168O 

ccccaagggt tcaacactgt ggttggagaa aaggg g aaattc ttct cctagatgaa 1740 
cagcggattg cgattgcccg tgctctgcta aagaatccca • agatcgactg 1800 

gcaaccagtg cgctggatgc cgaaaatgag taccttgttc ^agaagctcc g J 1860 
atggatggaa gaacggtgtt agttattgcc «tcgtctgt ^ccattaa 192Q 
atggttgctg ttcttgacca aggaaaaatt actgaatatg ^aacatga gg ^ ^ 
tcaaaaccaa atgggatata cagaaaacta atgaacaaac cagaaaaaaa 2040 

ggaagcaatt actggtaaac aatatgagac "taatgcaa aacagtgttg * 21QQ 
actcagagac tatgaaatac ataaaccata tatcaagtta "tgaaaa fct 2160 

ttccaaagtg tgtaaaagat tgttttgaaa cgtacctgtt ctcaagatct 222Q 
agttttaata attgtaactt "taaatgtc tatagcactg J ttcttttgcc 2280 

attttctttt cttgtggaat attttaatta atatagcatg tataaataaa 2340 

tgctgttaaa gattgaagct attgtcaaat Qacaacttta aaaaggcaac 24Q0 
aagcctgatt attttaggcc agtttaccaa t«»ctgtgta "tcttccgg J 2460 
ccLctttta agtctaattt taccgatcga ataacgcgct gtgaatttt ^ 
tcggtaatct ctgaggaaac ctcttttttt acacccgcgg ayaayyy a 255g 
ccncgggttt acagggggac acggaaatgt ccctcgaa 



<210> 35 
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<211> 5065 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3200979CB1 

<400> 35 



atggttaaaa aagagataag cgtgcgtcaa caaattcagg ctcttctgta ^aagaatttt 60 
cttaaaaaat ggagaataaa aagagagttt attgggctat atttgtgcat cttttcggaa 120 
cacttcagag ctacccgttt tcctgaacaa cctcctaaag tcctgggaag cgtggatcag 180 
tSaatgac? ctggcc?ggt agtggcatat acaccagtca Staacataac -aaaggata 240 
atgaataaga tggccttggc ttcctttatg aaaggaagaa cagtcattgg gacaccagat 300 
gaagagacca tggatataga acttccaaaa aaataccatg aaatggtggg agttatattt 360 
ag?ga?actt tc?catatcg cctgaagttt aattggggat atagaatccc ^gttataaag 420 
gagcactctg aatacacaga acactgttgg gccatgcatg gtgaaatttt "gttacttg 480 
gcaaagtact ggctaaaagg gtttgtagct tttcaagctg caattaatgc tgcaattata 540 
gaagtcacaa Saatcattc tgtaatggag gagttgacat cagttattgg aataaatatg 600 
aagataccac ctttcatttc taagggagaa attatgaatg aatggtttca "ttacttgc 660 
Sagtttctt tctcttcttt tatatacttt gcatcattaa atgttgcaag S^aagagga 720 
aaa?ttaaga aactgatgac agtgatgggt ctccgagagt cagcattctg jctctcctgg 780 
ggattgacat acatttgctt catcttcatt atgtccattt ttatggctct S^taaca 840 
Ecaatcccaa ttgtatttca tactggcttc atggtgatat tcacactcta tagcttatat 900 
ggcctttctt tgatagcatt ggctttcctc atgagtgttt taataaggaa ^tatgctc 960 
gctggtttgg ctggatttct cttcactgta ttttggggat gtctgggatt cactgtgtta 1020 
?atagacaac ttStttatc tttgggatgg gtattaagtc ttcttagccc "ttgccttc 1080 
actgctggaa tggcccagat tacacacctg gataattact taagtggtgt tatttttcct 1140 
aatccc?ctg ggaattcata caaaatgata gccacttttt tcattttggc atttgatact 1200 
StttSS? ?gKattcac attatat.ttt gagcgagttt tacctgataa agatggccat 1260 
ggggattctc cattattttt ccttaagtcc tcattttggt ccaaacatca aaatactcat 1320 
cSgaaatct ttgagaatga aataaatcct gagcattcct ctgatgattc ttttgaaccg 1380 
atotctccag aattccatgg aaaagaagcc ataagaatca gaaatgttat aaaagaatat 1440 
aa?ggaaaga c?ggaaaag? agaagcattg caaggcatat tttttgacat atatgaagga 1500 
caga?cac?g caSacttgg gcataatgga gctggtaaat caacactgct aaacattctt 1560 
ag?ggattg? ctgtttctac agaaggatca gccactattt ataatactca actctctgaa 1620 
aXactgaca tggaagaaat tagaaagaat attggatttt gtccacagtt <=aattttcaa 1680 
Sgac?tcc tcactgtgag agaaaacctc agggtatttg ctaaaataaa agggattcag 1740 
ccaaaggaag tggaacaaga ggttttgctg ctagatgaac caactgctgg attggatccc 1800 
ttttcaagac accgagtgtg gagcctcctg aaggagcata aagtagaccg acttatcctc 1860 
ttcagtaccc aattcatgga tgaggctgac atcttggctg ataggaaagt ^tttc^gtct 1920 
H 9 gt tgaaatgtgc aggatcatct ttgtttctga agcgaaagtg S^ 3 ^ \ 9&0 
?atca??taa glttacacag gaatgaaatg tgtgacacag aaaaaatcac at cccttatt 2040 
aagcagcaca ?tcctgatgc caagttaaca acagaaagtg aagaaaaact tgtatatagt 2100 
??gcc?ttgg aaaaaacgaa caaatttcca gatctttaca gtgaccttga taagtgttct 2160 
gaccagggS taaggaatta tgctgtttca gtgacatctc tgaatgaagt attcttgaac 2220 
ctagaaggaa aatcagcaat tgatgaacca gattttgaca ttgggaaaca ^gaaaata 2280 
catgtgacaa gaaatactgg agatgagtct gaaatggaac aggttctttg "ctcttcct 2340 
gaaacaagaa aggctgtcag tagtgcagct ctctggagac Qacaaatcta tgcagtggca 2400 
acacttcgct tcttaaagtt aaggcgtgaa aggagagctc ttttgtgttt ^tactagta ;4bo 
cttggaattg cttttatccc catcattcta gagaagataa tgtataaagt aactcgtgaa 2520 
actcattgt? gggagttttc acccagtatg tatttccttt ctctggaaca jatcccgaag 2580 
acacctctta ccagcctgtt aatcgttaat aatacaggat caaatattga agacctcgtg 2640 
caftcSga agtgtcagga tatagttttg gaaatagatg actttagaaa cagaaatggc 2700 
tcagatgatc cctcctacaa tggagccatc atagtgtctg gtgaccagaa SSattacaga 2760 
tt??cagttg catgtaatac caagaaatcg aattgttttc ctgttcttat gggaattgtt 2820 
agcaatgccc tta?tggaat ttttaacttc acagagctta ttcaaatgga ^gcacttca 2880 
ttttttcgtg atgacatagt gctggatctt ggttttatag atgggtccaf atttttgttg 2940 
ttgatcacal actgcattlc tccttatatt ggcataagca gcatcagtga "ataaaatc 3000 
cStcctcta tcccttctat tctttgtcag aaaaatgttc aatcccagtt atggatttca 3060 
ggcctctggc cttcagcata ctggtgtgga caggctctgg tggacattcc attacacttc 3120 
??gattc?cc tttcaataca tttaatttac tacttctcat ttctgggatt ccagcttcca 3180 
tgggaactca tgtttgtttt ggtggtatgc ataattggtt gtgcagcttc tcttatattc 3240 
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ctcat g tac g tgctttcatt cat™ aagt^aa = cttttgot 3300 
tttggctttt ttattgtctt aatatgtgta tccacaattc ^ agata tgtcg 3420 

aaacccaact taattttgtg catgatattc a taccttcct ^ caatagaata 3480 

ttattgatcc agctcaactt tatgtatatg a 9 aaa jttgg ^ataccSS gagtgttatt 3540 
aatgaagtca ataaaaccat tcttttaaca aacttaatac ^taccttca g g g ^ 
ttcctttttg tcataaggtg tctggaaatg ^agtatggaa ^gaaatcac J 366Q 
ccagttttca gaatctctcc acgaagtaga ^^cata ccaatccaga 9 9 3?20 
gaagatgttc aagctgaaag agtccaagca gaaatgcac tcactactcc ^ ^ Q 

gaggaaccag tcataactgc aagctgttta cacaaggaac a a tgt taaaaaa 3840 

tgcttttcaa caacaaagaa gaaagcagcc atcagaaatg "tcgttccg | 3900 
ggtgaagttt tgggattact aggacacaat 9gagctggca aaagtacttc cat^ 9 ^ 
ataactgggt gcacagtgcc aactgcagga fftggtggtgt tacaaggca *^ 402Q 
gtaaggcaac agcgtgacaa -gcctcaag ttcttggggt -tgccct^ 
ctgtggccca agcttacaat ^aagagcac tcggag g tcttaa gctccaggaa 4140 

ggcaaagaag atgctgctct cagtatttca °J«t|JJJJ agagaaagct gtgctttgtg 4200 
caacttaagg ctcctgtgaa aactctatca ^gggaataa agaga g^ gac 4260 

ctgagcatcc tggggaaccc atcagtggtg ^tctagatg a 9 9 ccaggagagg 4320 

cccgaggggc agcagcaaat 9tggcagata ^tcaggcca ccgtgtggcc 4380 

ggcaccctct tgaccaccca ttacatgtca 9aggctaagt g a ^ aaacaagttt 4440 
atcatggtgt caggaacgct aaggtgtatt ^tccattc aac « agct ctccac 4500 
ggtaaagatt atttactaga aataaaaatg aaagaaccta ctcagguyy a fc 4560 

acagagattt tgaagctttt cccacaggct 9Cttggcagg -agatattc ctcttt^g ^ 
gcgtataagt tacctgtgga 99atgtccac ^tctatctc 9TO ct tggagcag 4680 

gcgatgaaac agaccttcaa cctggaggaa » f c f aattgataca 4740 
gtattcttag aactctgtaa agagcaggag ^gggaaatg ttgatgataa 9 acatt 48Q0 
acagttgaat ggaaacttct ^acaggaa gacccttaaa |tgaagaa ^ 4860 

caattttagg tcctactaca ttgttagttt ^cataattct acaag g acagtttt ca 4920 
ttcagttaac aaaagaaaat attcaatagt ttaaacatgc ^caatgact g 4JB0 
tttttaaaaa tttaggatga aggaaaaagg aaatataggg ^gtagca g fc 5Q40 
acaaaatcag acatgttatt atccccaaca tgggtctatt ttgtgctcag ggg 5()65 
agtttagaac gccggaccgc gtggt 



<210> 36 

<211> 1677 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 6754139CB1 



gjggaaacag gtttgagaga tactggaggg 9gcagagcag tjgggtttjg -tccctggg 60 
?gaaagtctg gactcttgtg 9cttatttgg 9cccctctag catttgtgga M0 
gactccaggt ccttgaaaag 9ggagggtgg a 9f agaaa ^ tgtc g a 24Q 
atagtaccag ttcactccat ggcctttacc tcat 9t9tcc "gcagg a * 300 
aaclagagcc acagctagag caagagaagg cagacaccag gggacactc ataagg g ^ 
ggccccagcc ctgggagtgg agggtgtgag c^aggccct |ggactaggg 999 | ^ 
acaaccctcc ttactgaccc tccagagtgc ctgggagctg a 9ggccggct gg 9 ^ 
ctgttccgtg acctctttgc caactacaca agtgccctga 9 a «tgtggc g » 54Q 
cagactctga atgtgaccct ggaggtgaca ctgtcccaga ^atcgaca gg « 600 
aaccaggtgc tgaccctgta tctgtggata cggcaggagt 9gacagatgc c 9 66Q 

tgggacccca atgcctatgg tggcctggat gccatccgca WOT g a 
cggccagaca tcgtactcta taacaaagcc 9acgcgcagc ctccaggttc cgccag^ ^ 
aacgtggtcc tgcgccacga tggcgccgtg ^gctgggacg eg cctgacgttc 840 

tcgtgccgcg tggatgtagc agccttcccg "cgacgccc ageactgegg | c ^ 
ggctcctgga ctcacggcgg gcaccaagtg 9atgtgcggc cgcg gg a 960 
ctggcggact tcgtggagaa cgtggagtgg cfcgtgctgg 9catgccggc g gg aa ^ 
gtgctcacct aeggctgetg ctccgagccc taccccgacg tcaccttca 9 9^ ^ 
cgccgccgcg ccgccgccta cgtgtgcaac c^ctgctgc «tgcgcg tc 1140 

cltgcgccgc tcgccttcca cctgcctgcc gaetcaggeg a 9aaggtgtc JJW^ 120Q 

SSSSS rS SS2SS SSSSS tgLLU caca ttCt ca 1260 
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acagcactca ccatccttat catgaacctg 
ccagcctggg ctagggccct cctgctggga 
agaggggagc cctgtgggca gtccaggcca 
gaaggagggg ctggcccccc agcgggccct 
gaagccctac tgcaccacgt agccaccatt 
cagcgctgcc atgaggactg gaagcgcctg 
atcttcttct ccatggccct ggtcatgagc 

<210> 37 

<211> 3714 

<212> DMA 

<213> Homo sapiens 



cattactgtg gtcccagtgt ccgcccagtg 1320 
cacctggcac ggggcctgtg cgtgcgggaa 1380 
cctgagttat ctcctagccc ccagtcgcct 1440 
tgccacgagc cacgatgtct gtgccgccag 1500 
gccaatacct tccgcagcca ccgagctgcc 1560 
gcccgtgtga tggaccgctt cttcctggcc 1620 
ctcctggtgc tggtgcaggc cctgtga 1677 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 6996659CB1 



aJgaggagac tgagtttgtg gtggctgctg agcagggtct gtctgctgtt sccgccgccc 60 
Jgcgcactgg tgctggccgg ggtgcccagc tcctcctcgc acccgcagcc ctgccagatc 120 
c?caaqcala tcgggcacgc ggtgagggtg ggcgcggtgc acttgcagcc ctggaccacc 180 
accccScg cggccagccg cgcEccggac gacagccgag caggagccca gagggatgag 240 
?cggagc?ag ggac?aggcg glccccggcg ccctcgccgg gcgcacgctg gttggggagc 300 
accctgcaS gccgggggcc gccgggctcc cgtaagcccg gggagggcgc cagggcggag 360 
accctKac cacgggacgc ccccctattt gccgtggaca acctgaaccg cgtggaaggg 420 
ctgcSc?ct acaacctg?c tttggaagta gtgatggcca tcgaggcagg cctgggcgat 480 
ctaccacttt tgcccttctc ctcccctagt tcgccatgga gcagtgaccc tttctccttc 54U 
ctgcaaagtg tgtgccatac cgtggtggtg caaggggtgt cggcgctgct cgccttcccc 600 
caaaaccagg gcgaaatgat ggagctcgac ttggtcagct tagtcctgca cattccagtg 660 
ScagcScg ?gcgccacga g?t£ccgcgg gagagtcaga atccccttca cctacaactg 720 
agS?agaaa a?tcattaag ttctgatgct gatgtcactg tctcaatcct ^ccatgaac 780 
aactggtaca attttagctt gttgctgtgc caggaagact ggaacatcac cgatttcctc 840 
c?cc??accc agaataattc caagttccac cttggttcta tcatcaacat caccgctaac 900 
ctcccc?cca cccaggacct cttgagcttc ctacagatcc agcttgagag tattaagaac 960 
agcacaccca cSggtgat gtttggctgc gacatggaaa gtatccggcg gattttcgaa 1020 
actacaaccc ag?t?ggggt catgccccct gaacttcgtt oggtgctggg J^tcccag 1080 
aatgtggagg aactgaggac agagggtctg cccttaggac tcattgctca tggaaaaaca liau 
acacaatcta tctttgagca ctacgtacaa gatgctatgg agctggtcgc aagagctgta 1200 
gccacJgcca ccatgScca accagaactt gctctcattc ccagcacgat gaactgcatg 1260 
gaggtggaaa ctacaaatct cacttcagga caatatttat caaggtttct ^aatacc 1320 
ac??tSgag gcctcagtgg ttccatcaga gtaaaaggtt ccaccatcgt cagctcagaa 1380 
aacaactttt tcatctggaa tcttcaacat gaccccatgg gaaagccaat gtggacccgc X44U 
Sgggcagct ggcagggggg aaagattgtc atggactatg gaatatggcc ^gagcaggcc 1500 
caglgacaca aaaccclcll ccaacatcca agtaagctac acttgagagt 99t^ccctg 1560 
aXgagcatc cttttgtctt cacaagggag gtagatgatg aaggcttgtg ^ctgctggc 1620 
caactctgtc tagaccccat gactaatgac tcttccacat tggacagcct "ttagcagc 1680 
ctccatagca gtaatgatac agtgcccatt aaattcaaga agtgctgcta tggatattgc 1740 
attaatctgc tggaaaagat agcagaagac atgaactttg acttcgacct ctatattgta 1800 
aaagatggaa ag?a?ggagc atggaaaaat gggcactgga ctgggctagt gggtgatctc 1860 
c?gSaggga cfgcccacat ggSgtcact tcctttagca tcaatactgc acggagccag 1920 
Itlllllalt ?caccagccc tttcltctcc accagcttgg gcatcttagt gaggacccga 1980 
S?acagcag ctcccaltgg agccttcatg tggccactcc actggacaat ^ggctgggg 2040 
atttttataa ctctgcacat cactgccgtc ttcctcactc tgtatgaatg gaagagtcca ^iuu 
tttggt?tS cttccaaggg gcgalataga agtaaagtct tctccttttc ttcagccttg 2160 
aacatctglt atgccctctt gtttggcaga acagtggcca tcaaacctcc aaaatgttgg 2220 
actggaa?gt ttctaatgaa cctttgggcc attttctgta tgttttgcct ttccacatac 2280 
acggcaaact tggctgctgt catggtaggt gagaagatct atgaagagct ttctggaata 2340 
ca?gacccca agttacatca tccttcccaa ggattccgct ttggaactgt ccgagaaagc 2400 
agSctgaag a?tatgtgag acaaagtttc ccagagatgc atgaatatat sa^aggtac 2460 
aatgttccag ccacccctga tggagtggag tatctgaaga atgatccaga jaaactagac 2520 
accttcatca tggacaaagc ccttctggat tatgaagtgt caatagatgc tgactgcaaa 25»o 
cttctcactg tgSgaagcc atttgccata gaaggttacg ocattgcct cccacccaac 2640 
tctccattga ccgccaacat atccgagcta atcagtcaat acaagtcaca tgggtttatg 2700 
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gatatgctcc atgacaagtg gtacagggtg Sttccctgtg gaagagaag ttttgctgtc 2760 

sks ssss sssss ssg sgs s ss 

SSSSS SSSSS SSSSS Ss ssSS iSS 

gaaaagaggt ctaatgtggg -cccgtcag cttaccgtat g-tacttc 3 
catgacaacc gacggaaata catctttagt S^cgagg y « cttgcggacc 3180 
cggatccacc aggacatccc cctccctcca aggagaagag ^tccctgc g gg 
accaatggga aagcagactc cctaaatgta tctcggaact ^ggatgca ^ 

gagctcgaga agcagattca ggtgatccgt ^aggagctgc agtggctgt gag gg JJj0 

acggagctgg aggagtatca aaggacaagt cggacttgtg ^cctaggt g J 342Q 

cttccctttc tcagttcctg accttcctct gagcccttga ^caccccgc » 3480 

tgtaactatc gacaaaggtg tggggaagct ^ggtctagg ^ttcttaaa g 354Q 
gctctccctc gcctaaagtg cagcagcagc tcctctcaag ^actctct agg a ^ 

ggtaggagtg tttttctagc aagaatctta Stcaggagta agctctgtgc g £J ■ 366Q 

tgaataacca gataacccca gctgccgtta accttttcac ^^cc* " 3714 
ctggttttta gccctttctc tgcactacca acaagagata aaattgttac tcac 



<210> 38 

<211> 1009 

<212> DNA 

<213> Homo sapiens 

<220> 

■<221> misc_feature 

<223> Incyte ID No: 7472747CB1 



cgaggaggag ggcagggcag gcggcagcct ^acaggc ccctaggtgc ttactcctca 60 

ggctcagtcc agggaccagt ttcagttgga gaagctgcgc ttcc yy a gg g tgaaag g 3 60 
cctggaccag tgggccatgg agcagtttgt g^aggtcatc ^tggaagcct gggtga gg 

-Ft ES » li S s=5= S 

aggtcaggtc ttctgtgtct tccacgccct 9^ya« af-^rtaaaaat aggaggaccg 600 
caaccacctg ggcacagggc tgcgtgccca tctggccgcc attgaaagat gggagg g ^ 
tcccaggcgc tcccaggagg tactgcaagt cctgggcctg ^tctgttcc ^ 999 ^ 
gacgctggtc attctcatct tcccacccat ^tcttcagc -tgtggagg J^W^ 

SSSSS SSSS a SS2K ctcajgtat cggagcctgg 40 

5SS5 SSSS SSSSS SSSSS = i— t- !S 

tgaccccagt gggctcccca ggcctcagaa gatccccatc tctgcatga 



<210> 39 

<211> 1155 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7474121CB1 



S££ot cggggcaccc ccaggccagg agatgctgcc cagaggccct gggaaagctc 60 q 

ttccctggcc tctgcttcct ctgctttctg ^gacctacg ccctgg sgg J 180 

ttctctgcca ttgaggacgg ccaggtcctg f^^ao acg yy y « 240 

ttcttggagg agctctgcag aatcttgaac tgcagtgaaa cagtggtgga * 30Q 

caggatctcc aggggcatct gcagaaggtg aagcctcagt ggttta^ g 9 ggctatggc 3fi0 
tggtccttcc tgagctcgct ctttttctgc facacggtg » tctctttggt 420 
tacatctacc ccgtcaccag gcttggcaag tacttgtgca tgctctatgc tccccc gg 
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atccccctga tgttcctcgt tctcacggac 
acatcttata atcggttccg aaaattccct 
tgccccaaat ctctcttcaa gaaaaaaccg 
cagatcatca tcagtgctga agagcttcca 
ccaagctgca gcatggagct gtttgagaga 
caactgcccc cacaagccat ggagaggagt 
ctctcatact ccatcatcag caacctggat 
atccccctcc ccatcattgc ccttattgtt 
ctccccttct gggagacaca gttggatttc 
ctcaccacca ttgggtttgg ggatactgtt 
tccatttata tcatcgttgg aatggagatt 
aggctgattg acatatacaa aaatgttatg 
cttgttaaaa agtga 



acaggcgaca tcctggcaac catcttatct 480 
ttctttaccc gccccctcct ctccaagtgg 540 
gaccccaagc ccgcagatga agctgtccct 600 
ggccccaaac ttggcacatg tccttcacgc 660 
tctcatgcgc tagagaaaca gaacacactg 720 
aactcgtgtc ccgaactggt gttgggaaga 780 
gaagttggac agcaggtgga gaggttggac 840 
tttgcctaca tttcctgtgc agctgccatc 900 
gagaatgcct tctatttctg ctttgtcaca 960 
ttagaacacc ctaacttctt cctgttcttc 1020 
gtgttcattg ctttcaagtt ggtgcaaaac 1080 
ctattctttg caaaagggaa gttttaccac 1140 

1155 



<210> 40 

<211> 2733 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7475615CB1 



<400> 40 w cn 

cccttcattg age tec t tea ccagcaccag gaaggcaccg ctgatgagag cgaagatgag bo 
egaggegatg ttggtgtggg ggaggttttt gcaaatgtca atgaaggtaa agacgatgga 120 
ccctgggcct gtgtaggagg ggatggtcag tccgaagatg tacttgagca ccgaaatcag 180 
gaacaccttt cggctgcccg ctccccagac acatctgeag ccctgcccag ccggctttgc 240 
tcacccactg cttgtaaatg ccccagatat gagccagccc aggccccgct acgtggtaga 300 
cagagccgca tactccctta ccctcttcga cgatgagttt gagaagaagg aceggacata 360 
cccagtggga gagaaacttc geaatgeett cagatgttcc teagecaaga tcaaagctgt 420 • 
ggtgtttggg ctgctgcctg tgctctcctg gctccccaag tacaagatta aagactacat 480 
cattcctgac ctgctcggtg gaetcagegg gggatccatc caggtcccac aaggcatggc 540 
. atttgetctg ctggccaacc ttcctgcagt caatggcctc tactcctcct tcttccccct 600 
cctgacctac ttcttcctgg ggggtgttca ccagatggtg ccaggtacct ttgccgttat 660 
cagcatcctg gtgggtaaca tetgtctgea gctggcccca gagtcgaaat tccaggtctt 720 
caacaatgcc accaatgaga gctatgtgga cacagcagcc atggaggctg agaggctgea 780 
cgtgtcagct acgctagcct gcctcaccgc catcatccag atgggtctgg gettcatgea 840 
gtttggcttt gtggccatct acctctccga gtccttcatc eggggcttea tgacggccgc 900 
cggcctgcag atcctgattt cggtgctcaa gtacatcttc ggactgacca tcccctccta 960 
cacaggccca gggtccatcg. tctttacctt cattgacatt tgeaaaaace tcccccacac 1020 
caacatcgcc tcgctcatct tcgctctcat cagcggtgcc ttcctggtgc tggtgaagga 1080 
getcaatget cgctacatgc acaagattcg cttccccatc cctacagaga tgattgtggt 1140 
ggtggtggca acagctatct ccgggggctg taagatgece aaaaagtatc acatgeagat 1200 
cgtgggagaa atccaacgcg ggttccccac cccggtgtcg cctgtggtct cacagtggaa 1260 
ggacatgata ggcacagcct tctccctagc categtgage taegtcatea acctggctat 1320 
gggccggacc ctggccaaca ageaeggcta cgacgtggat tcgaaccagg agatgatege 1380 
tctcggctgc agcaacttct ttggctcctt ctttaaaatt catgtcattt gctgtgcgct 1440 
ttctgtcact ctggctgtgg atggagctgg aggaaaatcc cagtctgtgc taggagcect 1500- 
gatcgctgtc aatctcaaga actccctcaa gcaactcacc gacccctact acctgtggag 1560 
gaagagcaag ctggactgtt gcatctgggt agtgagcttc ctctcctcct tcttcctcag 1620 
cctgccctat ggtgtggcag tgggtgtcgc cttctccgtc ctggtcgtgg tcttccagac 1680 
tcagtttcga aatggctatg cactggccca ggtcatggac actgacattt atgtgaatcc 1740 
caagacctat aatagggece aggatatcca ggggattaaa ateatcaegt actgctcccc 1800 
tctctacttt gccaactcag agatcttcag gcaaaaggtc ategecaaga ctgtctccct 1860 
gcaggagctg cagcaggact ttgagaatgc gccccccacc gaccccaaca acaaccagac 1920 
cccggctaac ggcaccagcg tgtcctatat caccttcagc cctgacagct cctcacctgc 1980 
ccagagtgag ccaccagcct ccgctgaggc ccccggcgag cccagtgaca tgctggccag 2040 
cgtcccaccc ttcgtcacct tccacaccct catcctggac atgagtggag teagcttegt 2100 
ggacttgatg ggcatcaagg ccctggccaa gctgagctcc acctatggga agateggegt 2160 
gaaggtcttc ttggtgaaca tccatgccca ggtgtacaat gaeattagee atggaggcgt 2220 
ctttgaggat gggagtctag aatgeaagea cgtctttccc agcatacatg acgcagtcct 2280 
ctttgcccag geaaatgeta gagaegtgae cccaggacac aacttccaag gggctccagg 2340 



65/85 



PCT/US01/24217 

WO 02/12340 

~~ iiss ss m ii§ m i 

SKSS EXIST. SSSK SSSS Ki ■» 

agcagggggg tgctaccctt acaggagtga gagtctggtg agcccactct tc y ^ 
ggcctggccg caatggacaa gcctcctgct cac 

<210> 41 

<211> 3457 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7475656CB1 

Jgag^tgga gcccgcgccg tcgccggccg cgtcctccgg gcatggaagg aggcggcaag 60 q 
cccaactctt cgtctaacag ccgggacgat ggcaacagcg tcttccccgc gg a 

gcgccgggcg cggggccggc cgcggccgag aagcgcctgg jcaccccgcc ^f f f 240 

ggggccggcg cgaaggagca cggcaactcc ^gtgcttca ^tggacgg 3QQ 

gaggagccgg cggggggctt cgaagacgcc ^ggggcccc ggc ggc g a gcatg ttt 360 

cagaggcagt tcacctccat gctgcagccc WOTtcaaca jattctccct c g a ^ 

gggagccaga aggcggtgga aaaggagcag gaaagggtta ^actgcagg ^ 48Q 

atccaccctt acagtgattt caggttttac tMWJtttaa taatgcttat a g jg ^ 

ggaaatctag tcatcatacc agttggaatc ^attcttta ^gagcaaac 6Q0 

. ?ggattattt tcaatgtggc atcagataca ^tttcctat gjacctgat 660 

aggactggga ctgtcaatga agacagttct ^aatcatcc ^ggacc agtggattat 720 

atgaattatt taaaaagctg gtttgtggtt ffacttcatct ^tccatccc 9 ^ ?80 
atctttctta ttgtagaaaa aggaatggat tctgaagttt acaagacagc CK8 

SSSS 552=2 SSSS S |S S sr J- p. 

ssss SSSS SS b SSSS iSIS 
SSSS SKSSS SSSS SSSS Ua = 3 ™ s uj. 

accatgctga gcatgatcgt cggggccacc tgctotgcca tgtttgtcgg J 1260 
gctttaatcc agtctotgga ttcttcgagg cggcagtatc ^agagaagta taag ge ^ 
gaacaataca tgtcattcca taagttacca £gatatgc ^cagaagat ^ 1380 
tatgaacaca gataccaagg caaaatcttt gatgaggaaa ^attccca | 1440 
gatcctctga gagaggagat ajtcaacttc -ctgtcgga aactggtggc ^ 
ttatttgcta atgcggatcc taattccgcg acuy ^ = t-nrt^aaaaa aatgtatttc 1560 

SSSS SSSS SSSS 23553 g s i-g 
SSSS SSSS SSSS? SS SS SJS S jjj. 
SSSS SSSS SSSS S c i« ig. 

aacaatcagg agaacgaaat cctcaagcag attgtgaaac atgacaggg g g 1980 
gcaatcgctc ccatcaatta tcctcaaatg acaaccctga ^tccacatc gtc g ^ 

accccgacct cccgcatgag gacacaatct ^«ggtgt ^cagcgac g gc ^ 
cacagcaacc tgcactcccc cagtcccagc acacagaccc cccagccat a 2160 
tcac?ctgct cctacaccac cgcggtctgc agccctcctg t*«gagccc tctggc g 
cgaactttcc actatgcctc ccccaccgcc tcccagctgt ^actcatgca acag g a 
cagcagcagg tacagcagtc ccagccgccg -agactcagc cacagcagcc g^tJ 
ccacagacac ctggcagctc cacgccgaaa jatgaagtgc ^caagagc J^JJ 2400 
cacaacacca acctgacccg ggaagtcagg "actctccg cctcgg cctggcctcc 2460 
catgaggtgt ccactctgat ttccagacct "tcccactg tgggcgagtc cctgg ^ 
atccctcaac ccgtgacggc ggtccccgga ^S^ctec aaaccltccc cccgaaccga 2580 
gtcccgcagc gcgtcaccct cttccgacag a^cgtcgg ^coatccc cj ^ 

SSSSS "acgc agaaaagSca SStfott caaatttatg atccctgctg 2700 
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attgtcaaag cagaaagaaa tactctcata -ctgagact atactcagat cttattttat 2760 
tctStctcct gatagatccc tctagcctac tatgaagaga tattttagac ^gctgtgg^ ^ 

tacacgtgaa atgtaaaaat atatatacat atactataaa atatatatc 294Q 

gagagggtca aaagacctgt ttagcattca ^gttatatg tcttccttc 3QQQ 

taaaggattt aaaatgtcgt tgtaagatta t tta tttcta acctactct * Q60 

tttgalatgt atatttctct attttatgaa gagttcttgg attcaatgga aacaa g 
attttaaaaa ggcaactcaa atgaactagt aaatagcacc aatcaaaact Jtc 

agctgtgtct ctgcatctaa attgttaatc attaatggtg ggaattaaa ^ ^ 

cattttatag atctaaattg tatttcggtg ctttcaattt ^aattaggt J « 33Q0 

actacttgct tggccaccgt aggagactag cattgccact |"tgttaag J 3360 
acctcaaaca tgttcattga tctttcagaa agctgaggga aaattaatat «g 

gtgttatcgg acttttacca agactcgatc aatgttagtt gtaaataact zzz. ^ 
aaataaaaat agctattctg tgttgtaaaa aaaaaaa 



<210> 42 

<211> 5622 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7480632CB1 



ftggacfagg agaaaaacat agggaaaaaa ccaacagaat ttgttggcat gtctacaca 60 q 
cagaccatgg cttttcagaa gccaagctga ataaaaacag ^ttaaaaga ^ ^ 
tgtagaggag tccttgaagg attcttcatt Ottttcttgg acaa g g acfc 24Q 

ccaagtgctt caaatacttc tctcttattt Jcttoactct attgct g caaat 300 

ttaccctgtt aatgaacagg acaaaatggt taaaaaagag ataagcg g J 360 
tcaggctctt ctgtacaaga attttcttaa aaaatggaga ataaaaagag g ^ ^ 
ggaatggaca ataacattgt ttctagggct atatttgtgc atcttttcgg afcga 48Q 

agctacccgt tttcctgaac aacctcctaa agtcctggga ag g taa tgaataa 540 

ctctggcctg gtagtggcat atacaccagt cagtaacata acacaaagga | 600 
gatggccttg gcttccttta tgaaaggaag aacagtcatt g^caccag j gg^ ^ 
catggatata gaacttccaa aaaaatacca tsaaatggtg ^ag agg agcactc 720 

tttctcatat cgcctgaagt ttaattgggg fatagaatc ^cagttataa gg 9 ?80 
tgaatacaca ggtcactgtt gggccatgca tggtgaaatt ttttgttact tggc ^ 
c?ggctaaaa gggtttgtag cttttcaagc tgcaattaat gtgcaatta tagaag^ ^ 
aacaaatcat tctgtaatgg aggagttgac fcagttatt Sgataaa g * 96Q 
acctttcatt tctaagggag aaattatgaa tgaatggttt ^attttactt g J 1Q20 
tttctcttct tttatatact ttgcatcatt aaatgttgca agggaaag g 9 1Q80 
gaaactgatg acagtgatgg gtctccgaga Stcagcattc tgctctcct WTO^. U4Q 
atacatttgc ttcatcttca ttatgtccat "ttatggct "ggcc atggcctt tc 1200 
aattgtattt catactggct tcatggtgat attcacactc tatagcttat 126Q 
tttggtggca ttggctttcc tcatgagtgt "taataagg aaacctatgc ^ ^ 

ggctggattt ctcttcactg tattttgggg atgtctggga ttcactgtgt tcac 138Q 
acttccttta tctttgggat gggtattaag tcttcttagc « ct gatccctc 1440 

aatggcccag attacacacc tggataatta cttaagtggt ^ttatttttc c J 15Q0 
tggggattca tacaaaatga tagccacttt tttcattttg 9°a"cga 156Q 
t??gatattc acattatatt ttgagcgagt "tacctggt aaggatggcc atgggg 
tccattattt ttccttaagt cctcattttg ^ccaaacat ^aaatactc 16QQ 
ctttgagaat gaaataaatc ctgagcattc ctctgatgat ^ctcccgaac yy » m0 
agaa?tccat ggaaaagaag ccataagaat cagaaatgtt ataaaagaat ataatgga^ ^ 
gactggaaaa gtagaagcat tgcaaggcat attttttgac atatatg^g 9 9 ggatfc 186Q 
tgcaatactt gggcataatg gagctggtaa atcaacaccg aaataactga 1920 

gtctgtttct acagaaggat cagccactat ttatutact caactctctg fc 198Q 

catggaagaa attagaaaga atattggatt "gtccacag ^ agccaaagga 2040 

cctcactgtg agagaaaacc tcagggtatt tgctaaaata aaaggg c * ttttcaag 2100 
agtggaacaa gaggtattgc tgctagatga accaactgct gg^u ^ tcttcagtac 2160 
aLccgagtg tggagcctcc tgaaggagca taaajtagac ojacttatcc tcttcaj^ ^ 
ccaattcatg gatgaggctg acatcttggc tgataggaaa 9 gatatcattt 2280 

gttgaaatgt gcaggatcat ctttgtttct gagcgaaag JWO^^J ? fcaagcagca 2340 
aagtttacac aggaatgaaa tgtgcgacac oyaaaac 
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cattcctgat g cca, g ttaa ca.=. g aaag tgaagaaaa. =«gt.tata »«t g c=ttt 2400 
ggaaaaaacg aacaaatttc cagatcttta MW« gtaSSIga aclt.gaSg 2520 

SMS SS SSg - = £• 
SSS S53S SS5 i S5 ==SS SS 

cttcttaaag ttaaggcgtg aaaggagagc ^ct.tttgtgt ttgttac g tt 2820 

tgcttttatc cccatcattc tagagaagat aatgtataaa ^ctcgtg ctct 2{J80 

ttgggagttt tcacccagta tgtatttcct "ctctggcm * tgcattcact 2940 

taccagcctg ttaatcgtta ataatacagg ^tcaaatatt ^agacctcg « 3000 
gaagtgtcag gatatagttt tggaaataga t^ctttaga ^acagaaatg gcc g ^ ^ 
?ccctcctac aatggagcca tcatagtgtc ^gaccag aaggattaca 9 3120 
tgcgtgtaat accaagaaat tgaattgttt t c ^gttctt acgggaa g * tttttt 3180 
ccttatggga atttttaact tcacggagct tattcaaatg ^gagcactt « ^ 
ttacataacc acaaaatctt ttcaaactaa ^tcccttcc tctatccctt « 3300 
tcagaaaaat gttcaatccc agttatggat ttcaggcctc tggccttcag ca gg^ 
tggacaggct ctggtggaca ttccattata cttcttgatt electee 3420 
ttactacttc atatttctgg gattccagct ttcatgggaa ctcatgtttg 348Q 
atgeataatt ggttgtgcag tttctcttat ^ttcctcaca ca g g taagta tata 3540 

252S SSS g «™ »™« r6 o„ 
SSSS SSSSS g » - « — 

sssss sss ssss i ii i ssss 5SS 

aaacttggag gaggaaccag toataactge ^gctgttta tttcctttto 3960 

aaagaaaagt tgcttttcaa caagaaagaa ^aaatagee atcagaaa g 4Q20 
tgttaaaaaa ggtgaagttt tgggattact aggacacaat ^J9«ggta J J 4080 
cattaaaatg ataactgggt gcacaaagcc aactgeagga g^gtggtgt t W 414Q 
cagagcatca gtaaggcaac agcatgacaa cagcctcaag "cttggggc « 42Q0 
ggagaactca ctgtggccca agcttacaat ^aagagcac "ggagttgt atg g^ ^ 
gaaaggactg ggcaaagaag atgetgetet cagtatttca ^tggegg * 4320 
gctccaggaa caacttaagg ctcctgtgaa aactctatca ^ggaataa agag g ^ 
gtgctttgtg ctgagcatcc tggggaaccc atcagtggtg ^^ctagatg ag 3 444Q 
egggatggae cccgaggggc agcagcaaat Qtggcagata ^tcaggc 9 4500 
caaggagagg ggcaccctct tgaccaccca ttacatgtca jaggctgagg 4560 
ccgtatggcc atgatggtgt caggaacget aaggtgcatc cccaggtgga 4620 

aafcaagttt ggtagagatt atttactaga aataaaaatg aaagaaccta cccagg^^ ^ 
agctctccac acagagattt tgaagctttt cccacaggct 9«tggcagg « 4740 
ctctttaato gcgtataagt tacctgtgga ^tgtccac cctctatctc sgg 480Q 
caagttagag gcgatgaaac agaccttcaa ^ggaggaa tacagcctct 4860 
cttggagcag gtattcttag aactctgtaa a ^gcaggag ^gggaa g a 4920 
aattgataca acagttgaat ggaaacttct cccacaggaa ^ateattS acaagaatgt 4980 
tcctaacatt caattttagg tcctactaca "gttagttt ccataattct acaag g^ 
ttccttttac ttcagttaac aaaagaaaac a "taataaa ^attcaataa tgac g^ 
tttcattttt aaaaatttag gatgaaggaa acaaggaaat atagj**** g c ? taaaaa 5 160 
aaattaacaa aatcagacat gttattcatc cccaacatgg gctattttg gctt 
taatttaaaa atcatacaat attaggttgg "ateggtta "atcaataa a 9 528Q 
gagaacattt tacaaataaa aatatgaggt "ttagcctg aa «tcaaac g £ 534Q 
Itlttaaaca ttatttactc ggattctaat "aatgtgac a "gactata aga gg g 
ataaactgat gaaatggcac agcataacat " aa ttataa tgacattctg 546Q 
aaattgcatg tgaattttag tacatattga a jttatatgg atttttatgg 55 20 

taagaaagta ccgcagttta atattttctt Jagccaactt acac g gtcagaac t 5580 
atctttttca aggtagtatc agtaggctag t^ a "tcgtt ctttcact g 
tccatgtttt tgcctggttt atgggtagtt aggttggtac ca 



<210> 43 

<211> 2600 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID No: 6952742CB1 



cJcotwLt ggaccccctc acgatggatt ctggatccag cagggcccag cgt ccatcca 60 
taccaggSa ggggctgggg cccgcgctgc caggagaagg cccagcacca atccccggac 120 
SS KSuoSS ccccaagggg cccgttgctg ccggggacc tgtcgtttgg 180 

51= S23 SSs SSS5 JS5RS = 

S SE =S Sis igg |si i 

SS3SS cScftSgtg gtcctLcat ggctgagcct gctgcgcggc gcc„ 1020 
S2SS SSKS S=ES SSSS ? g Wit- 1XJ0 

SEESS SSSSS SS53 ^ =S5S SSK gS 
3SSS5 SSSS S5S5 SSSSE Sgg I - 
SSSSS S5SSS SSSS SSSS - fes »S 
SSSS SSSS SSS2 SSSS SSSS ESS -20 

SSSS S?tgtgg?gg atgagcccgg ctgacgcgct ^ctgggca gcaccgtgg 1680 
ccacctgtat gctggtcagc acagaggccg TOCtgctggc tggcgtcatc °tctcgc g 
tcaacctacic cggccgcacc caaagccacg gcaccgccct gctggcccgc acc 99^ . ftfin 
cggccttSa cglggatgcc acagagttcg agggcctcgt ccctgagccc OTC^ffcww I860 
?a?tccact? tSaagggccg ctgtactatg ccaacaagga cttcttcctg cagtcactct 1920 
SgcctS gSgc?ggac gcagggtgca tggctgccag gaggaaggag BBflBBetaa ^00 
agSgggggt SSUSgga ggccctgccc agggcgagga cctgggcccg gtagcacca 2040 

i«S SS SS EES B~| |si 11 
33 iSS 53 §§ S i S E 
iEIS 5S= S 55 Sss ssss s: 

cccacagtgc tggctgggcc 



<210> 44 

<211> 2917 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> incyte ID No: 7478795CB1 



<400> 44 

gcagcgggaa gagcggagcg 
actgcacccc agacccgagc 
aacaggcctg gcctcgccct 
ccttcagcac ccgcccacct 
ctttggcctt catgagtgtg 



aggaccgcgt ccggcgcagt 
ctgctgcgcg ccccctccca 
gtggatgatg atggccttgc 
ccaaccagca ggatgcggct 
gacatctgcg tgaccacggc 



cttcaatgag cagcgcggaa 60 
gagctcacct ggtgccaggt 120 
ccccgtgagc tacaacctgg 180 
gtggaaggcg gtggtggtga 240 
catctatgtc ttcagccacc 300 
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ssss sssa sssss ssg ££I 3E S 

S3SS S== SK5 Sgg SSS SS 
SES5 SS525 55S5S c t g|gU «» 

cgalgSgag ? «c=ct ? ggagcggccg gcc.ccgcc= gHg ™0 

2SS5S ££25 SSSS «=l£==c «™ jjj 
SSS SK2 S5SS sgs 1 i | a 
5SSS S25S 255SS = £ S £E£S i~ 

cltcggacac caccatggtc agcgacctgg tctoccMJ. c«=aatgt= twetgow 
acacagtcaa ggtcacgggc gtggtggtct tcatgttcag gglaagtact 1260 

taatcacctt catgggcttc cccatcatca tgatggtgtc caacatccac „ n 

~B SSS S§ III 5S Si I: 

tgcagggagt gggggctgct gagaaggtgt tcgagttcat ^gaccggcag | 1740 
tgcacgatgg cagcttggcc cccgaccacc tggagggccg gtggacttt gag g g 
ccttcaccta ccgcactcgg ccccacaccc aggtcctgca gaacgccccc a « 
cccccggcaa gg?gacggc? ctggtggggc -tcgggcag tg^aagagc tcctgtgtca 1860 
acatcctgga gaacttctac cccctggagg mKPcmt ^gctggac ggc 9 
tcagcgccta cgaccacaag tacttgcacc gtgtgatctc cctggtgagc caggag a 
tgc?g?tcgc ccgctccatc acggataaca tctcctacgg -tgcccact ggcctttcg 2040 
agatggtggt ggaggccgca cagaaggcca atgcccacgg ^tcatcatg ga J* 
acggctacag cacagagaca ggggagaagg gcgcccagct Qtcaggtggc cagaag g 
ggg?ggccat ggcccgggct ctggtgcgga accccccagt cctcatcctg gtgaagcca 22 
ccagcgcttt ggatgccgag agcgagtatc tgatccagca OTC«tccat ggc J 

S3SS SSS SKS SSS= | ~ JS! 

SS2SS £== g=ag Sgg - »S 

SSS5S S?S 5SSSS S55S5 » ~ t -° 0 

Sggatcacc glttcctgca tcttgcccct gstccctgcc ccattcccag J^ctcctt 2700 
acccctgctg ccctgagcca acgccttcac ^cctccct ^^cctaa |^^ag 
agctgccttt ttaaacctag gtcttaccag ^ttttact gttggtttg W 288Q 
gtcaactcct agatttcaaa aacctttttc taattgggag taatggt-yyg 
aagatgttct agaaacttct gagccaggag tgaatgg 



<210> 45 

<211> 1474 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 656293CB1 



ftggggctcc ggagccacca cctcagcctg ggc™ ttctgtttct actccctgca 60 
gagtgcctgg gagctgaggg ccggctggct ctcaagctgt tccgtgacct "^g 1QQ 
Lcacaagtg ccctgagacc tgtggcagac acagaccaga ^tctgaatgt ^ccctgg g 
gtgacactgt cccagatcat cgacatggat gaacggaacc ^^gac ^tgcacccg 
?ggatacggc aggagtggac agatgcctac ctacgatggg accccaa gc ctatggtggc 300 
ctggatgcca tccgcatccc cagcagtctt gtgtggcggc caga « atggc 42 o 

aalgccgacg cgcagcctcc aggttccgcc agcaccaacg tggtcctgcg ccacgatg^ ^ 
gccgtgcgct gggacgcgcc ggccatcacg cgcagctcgt gccgcgtgga tgtag 
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ttcccgttcg acgcccagca ctgcggcctg acgttcggct cctggactca cggcgggcac 540 
caactggatg tgcggccgcg cggcgctgca gccagcctgg cggacttcgt ^aacgtg too 
gagtggcgcg tgctgggcat gccggcgcgg cggcgcgtgc tcacctacgg ctgctgctcc bbu 
raaccctacc ccgacgtcac cttcacgctg ctgctgcgcc gccgcgccgc cgcctacgtg 720 
XcaacSgc tgctgccctg cgtgctcatc tcgctgcttg cgccgctcgc cttccacctg 
cctgccgact caggcgagaa ggtgtcgctg ggcgtcaccg tgctgctggc S^caccgtc B4U 
ttccag?tgc tgctggccga gagcatgcca ccggccgaga gcgtgccgct ^^cgggaag 900 
tactacatgg ccactatgac catggtcaca ttctcaacag cactcaccat ccttatcatg Sbu 
aaccScaK actgtggtcc cagtgtccgc ccagtgccag cctgggctag ggccctcctg 1020 
Sgggaoacc Sgcacgggg cctgtgcgtg cgggaaagag gggagccctg tgggcagtcc 1080 
aaaccacctg agttatctcc tagcccccag tcgcctgaag gaggggctgg ccccccagcg 1140 
ggcccttgcc acgagccacg atgtctgtgc cgccaggaag ccctactgca ccacgtagcc 1200 
accattacca ataccttccg cagccaccga gctgcccagc gctgccatga ggactggaag 1260 
cgcctggccc gtgtgatgga ccgcttcttc ctggccatct tcttctccat ggccctggtc 1320 
a?gagcctcc ?ggtgctggt gcaggccctg tgagggctgg gactaagtca -gggatctg 1380 
ctgcagccac agctcctcca gaaagggaca gccacggcca agtggttgct ggtctttggg 
ccagccagtc tctccccact gctcctaaga tcct 

<210> 46 

<211> 1742 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7473957CB1 

^gggctcggc tccctgcctc cgcgtcgcag cccccgccgt agccgcctcc gagcccgccg 60 
ccacaS?c tgagcagaag atggctgtgc cacccacgta tgccgatctt gg^aatctg 120 
ccaaggatgt cttcaccaag ggctatggaa tttacaagct caggctcagc caacactgag 180 
accaccaalg tgacgggca? ?ctggaaacc aagtacagat gs-ctgagta cggcctgacg 240 
tttacagaga aatggaatac cgacaataca ctaggcaccg agattactgt ggaagatcag juu 
cttgcacg?g gac?gaagct gaccttcgat tcatccttct cacctaacac tgggaaaaaa 360 
aatgc?aaaa ?caagacagg gtacaagcgg gagcacatta acctgggctg ^gacatggat 420 
??cgacattg ctgggccttc catccggggt gctctggtgc taggttacga ggg^^gctg 480 
accaactacc agatgaattt tgagactgca aaatcccgag tgacccagag caactttgca 540 
ottggctaca agaSgatga a?tccagctt cacactaatg tgaatgaogg ^cagagttt 600 
ggcggctcca Staocagaa agtgaacaag aagttggaga ccgctgtcaa tcttgcctgg 660 
acagcaggaa acagtaacac gcgcttcgga atagcagcca agtatcagat tgaccctgac 720 
ccctacttct cggctaaagt gaacaactcc agcctgatag gtttaggata cactcagact 780 
ctaaagccag g?attaaact gacactgtca gctcttctgg atggcaagaa cgtcaatgct 840 
ggtggccaca agcttggtct aggactggaa tttcaagcat aaatgaatac tgtacaattg 900 
??taatttta aactattttg cagcatagct accttcagaa tttagtgtat cttttaatgt 960 
tgtaSctg Satgcaagt attgctaaat atgttagccc tccaggttaa agttgattca 1020 
gftttaaga? g?taccct?c cagaggtaca gaagaaacct atttccaaaa aaggtccttt 1080 
cagtggtaga ctcggggaga acttggtggc ccctttgaga tgccaggttt cttttttatc 1140 
taaaaataqc tgcaagtgga agcggataat atgtaggcac tttgtaaatt catattgagt 1200 
aaa?gaa?ga aaS?gl?t tcc?gagaat cgaaccttgg ttccctaacc ctaattgatg 1260 
aaaoqctcgc ?gct?gatgg tgtgtacaaa ctcacctgaa tgggactttt ttagacagat 1320 
SStgacc tgttSacc ccagttcatc atcatctctt ttacaccaaa aggtctgcag 1380 
aXtllttt Sgtttcttt tgtgccattt tggggtggag aaggtggatg tgatgaagcc 1440 
aaSScag gacttattcc ttcttgtgtt gtgttttttt ttggcccttg caccagagta 1500 
tgaaatagc? ?ccaggagct ccagctataa gcttggaagt gtctgtgtga ttgtaatcac 1560 
a?ggtgacaa cactcagaat ctaaattgga cttctgttgt attctcacca ctcaatttgt 1620 
?tt?tagcag ttaatgggta cattttagag tctccatttt gttggaatta gatcctccct 1680 
Jcaagggctg gattacacac ttaaaaactg attaatattg acttaaaaaa aagggccgca 1740 



ta 

<210> 47 

<211> 2312 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feat-ure 

<223> Incyte ID No: 7474111CB1 



^aagagaaa ggctgttttt gccggtgaca aggcgttcct ccccagactt t-toojjt 60 
ctaagggcag catctcgaga gggtccaggc ^gctacacg "aggcgcct tctc gg 
cgcgccccag aacgtggggg ccggggccgg Qtcggggacc J^tccctcc gcggg gg 
aggcggcacc gaggctgggg gatggggcgt ^cgccc t^gagcacg eg | |g 
gtcgccccgc gcggcggtag gaagagcaaa ^cggcagga J^gccgtgg ccggg g ^ 
ttccctgaga aatccttagg ggcgatgtca gaacccgagc ^ggctccgg ac g a 
gaaaaagctc tccaaacgcc gtctgtcccc ^acccgagt ccacactggg ^ttga ^ 
gggctgttaa aaggggccct ggggactgcc caattcatcc c ^tggccca ggg ay a 
cgggagcagg catcccggcg ctgggctccc cgctcccccg ccctgcggac ccctccccgc 54 q 

SSS5 2SSS W S2SS » SS5S « 
S35SS 3352 =S = g-Si s-ss - 
SSSS SSSE SSSSS | «j* s ~ j* 
32SSS S533S »S 5SSS i«. 

SS sss~ sss ssss ™ 

SSSSS 3S2SS * = 2 « ~ S 

acgccgtatt acatctctgt gttgatgaca gtgtttacag ^gagaactc tcaaccccag 

HS Si Si | i I I Is lili 

S5SS S55SS 2S5K 25SS | « f ™ 
SS 5SSSS 2SSS |1« -SSK S£S= 3S 
ESSE 2SSSS SSSS SSSS ™, gssg 

sssss sssss sssss sm sli = ss 
sss s a = Si S sssss ss 

ctattttcca aatgtttttc catcttatga ^ttcagaaga age ay atcaat gtca 2280 
ttttgtttga gagtaacatt ttcatttcta aatgttttat aatttctcac ac y 231J 
gaagtatcct ggaaacatat gtcacatgcg ag 



<210> 48 

<211> 2320 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7480826CB1 

<400> 48 

cccgccgctc geagggctge tccacagccg 
agtactgeta gcagcagccc gaccacgcgt 
ggcggacgcc ggaggattgg ggeggcaatt 
tcctgcctgt tgttggattt ggggaaattt 
tgagagaaac ccactgaaga cgtctgcgtg 
gccgctgcac ccaccgccaa ggacaaaagg 
tcccagtgct tagcatgaag aaggccgaaa 
acagcagcag ctacagttcc aacagegact 



cgcgacgccg ccgccttaga acgcctttcc 60- 
taccgcacgc tcgcgccttt cccttgacac 120 
tgtcttttcc ttttttatta aaattatttt 180 
tttgtttgtt ttttatgatt tgtatttgac 240 
agaatagaga ccaccgaggc cgactcgcgg 300 
agcccagcgc tactagctgc acccgattcc 360 
tgggacgatt cagtatttcc ccggatgaag 420 
tcaactactc ctaccccacc aagcaagctg 480 
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ctctgaaaag -ttjtg =— « gaacttt". ct = a M. 

sssss sssss S sssss sssss ssssss ss 

SSSSSS SSSSSSS SSSSSSS SSSSS SSSlStta ttatatgaae 780 

™s sssss sssss « SHE siisi 

tgatccaggc attaacgaac attgaagata aaactggatt gggtatctg JJJJJ 1Q20 

HIS EsBSss I I ill ESSE E 
Si HE sSS ||| his S 

ESS SSSSSS SSSSSS | a ujj 

sssss sssss sssss sssss f I 3s as 

SSSSS sssss sssss sssss ssssl a f j s 

SSSSS SSSSSS SSSSS SSSSS SS S SSS£ .soo 

tgttcttcct gttaagtggt gtactggtga tgaccggaag ^atggccttg ^tgttttgg i» 
Stgggtaca caatgcacct ggaggtggcc attaattggc accactcaaa ^^tca 1920 

SSSSS 5S SSS »gl SEI S 

aacaacaaaa ccacgagtct cgggttaagg gaagtgacaa "ttattcca "ccag g 

taaacaaact cttaactttt atcaagccac atgcttggct gtgtcattgt ttaa "^ga zzz 

ta?SSS tttScttga atgtgcctaa tggaaccatt tgatgtgaga aacaattctt 2280 

tttaatttac agcaaaatat tgaataacca ttgacaaaaa 



<210> 49 
<211> 1781 
,<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 6025572CB1 

<400> 49 



SSctSct cgcc.gt.cg .tgctgcagc ggttttccgg « t tccgctt ccc.tcatcg M 

sssss sssss sssss s s s s ~. xj. 

SSSSS SSSSS SSSSS SS SS SS £= — ? 300 
SSSSS 2552 SS SS HsSS SSSSS S! 

ssisl s s safe sssss sssss g 

SSSSS SSSSS? tagStgactg t.ttatgaa. atagca^aat -gatggaat 660 
Igclggttt. taccaagggt ttggtgtttc agtacagggc atcattgtgt accgagoctc fl 

SSSSS SSSSS SSSSS SSSSSS SSSSSS SSS «. 

SSSSSSS SSSSaagal gtatgatgat gcagagtggt ==«ctaaao g^aatataa 900 

sssss SSSSS SSSSS SSSSSS? sssss s 

SSIaattaaa gaattctttc atattgatat tggtggtagg taatcgggag ag»aattaa 1080 

sssss sssss ssssss ss s ss s g tr ^ .oo 

tttcaagaat ttaaatacta aaaatcagat aaatgtggat tttcctccca ctcaga c . 
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aacacatttt - agtgtgatat ttcatttatt 
aaattctttt tatgattaaa aattaatcat 
aatgaaagta gcgtctttta aattgctatt 
aagtcatatg gtatgacata tttcttaaaa 
gcagaaataa gctttgttct atatctcttc 
ttacagtatc agcctttgat tatagatgtg 
acattataaa actgaaactg gaaaataaaa 
taaataaaat cttgctagtg tgaatatatc 
gtttgtatat tttgttgaca ataaaggaag 

<210> 50 

<211> 2433 

<212> DNA 

<213> Homo sapiens 



ataggtagta tattttaatt tgttagttta 1320 
ataatcctag attaatgctg aaatctagga 1380 
catttaatat acctgttttc ccatcttttg 1440 
gcttatcaat agatgtcatc atatgtgtag 1500 
taagacagtt gttattactg tgtataatat 1560 
atcatttaaa atttgataat gactttagtg 1620 
tggcttatct gctgatgttt atctttaaaa 1680 
ttagaacaaa aggtatcctc ttgaaaatta 1740 
cttaactgtt a 1781 



<220> 

<221> miscjEeature 

<223> Incyte ID No: 5686561CB1 



ca?ccqc?ca caatgccaca tcaatgatac gagcacgtag cctcactgct tgcacagtgc 60 
a?gacagag? cggc?gcgag caggcgaggt ggcctgaggg aggtcactag gctggctgag 120 
.StStS SSStStS ag?cggcctg cttccaggca ccgtgtccat ^gggtaagc 180 
ggtctccctg ggtgcccact cttgcgcccg gagatcctga gtttggtcct gtctggccat zzv 
gaagctcagc SgStgggag gccacaggga gatgcaggct ^cggcggg tggatggttc 300 
caaccaatta qgtccggggc ctggagctca gcctgtgggg tggggaccca gtggtgccct jou 
ggagc?gccg ??tc?Jc?ct cag?aggatg atgggcagga cagggagagg ctgacctact 420 
llclgalocl gcctgagtct ctgacttccc tcctggtgct ^tgaccacg JJJ 
ccgatgtgat gattcctgcg tattccaaga accgggccta tgccatcttc "catagtct 540 
tcacStgat aggaagcctg tttctgatga acctgctgac agccatcatc tacagtcagt 600 
tccaaaacta cctgatgaaa tctctccaga cctcgctgtt tcggaggcgg ctgggaaccc 660 
gggc?gcct? tgaStccta tcctccatgg tgggggaggg aggagccttc cctcaggcag 720 
ttoaaataaa occccagaac ttgctgcagg tgcttcagaa ggtccagctg gacagctccc 780 
acalacaSc caSSJag aaggtgcgl? cctacgacag tgttctgctg tcagctgagg 840 
ag?t?caSa gcStSa? gagcttgaca gaagtgtggt ^agagcac ecgcc*^ 900 
ccgagtacca gtctccgttt ctgcagagcg cccagttcct cttcggccac tactactttg ybu 
ac?a?2ggg gaacctlatc gccctggcaa acctggtgtc catttgcgtg "cctggtgc 1020 
taoatacaga tgtgctgcct gctgagcgtg atgacttcat cctggggatt ctcaactgcg 1080 

sees sssuLt. ?t^ g ,t K «ctca. s s s=? 



1140 



S2S£ sssss skxs ssss i™ 

?ggagatftc aactctggct. gtgtaccgat tgccacaccc aggctggagg "ggagatgg 1260 
tggglctgct gtcgctgtgg gacatgaccc gcatgctgaa catgctcatc gtgttccgct 1320 
t?l?gcg?at latlccSag? atgaagccga tggccgtggt g^cagtacc Stcctgggcc 1380 
taatacaaaa catgcgtgct tttggcggga tcctggtggt ggtctactac gtatttgcca ±4«v 
tclt?ggga? caaS?g?tt agaggcgtca ttgtggctct tcctggaaac agcagcctgg 1500 
cccctgcSaa tggctcggcg ccctgtggga gcttcgagca QCtggagtac tgggccaaca 1560 
acttcgatga ctttgcggct gccctggtca ctctgtggaa cttgatggtg gtgaacaact ±b£J 
ggcagg?g?t tctggagca tatcggcgct actcaggccc gtggtccaag atctattttg 1680 
'tottataota gctagtgtcg tctgtcatct gggtcaacct gtttctggcc ctgattctgg 1740 
aoaa?t??c? ?caSagtgg gacocccgca gccacctgca gccccttgct gggaccccag 1800 
aggccaccta ccagatga?? gtggagctcc tgttcaggga tattctggag ggcccgggg I860 
aggatgagct cacagagagg ctgagccagc acccgcacct gtggctgtgc ^ggtgacgtc l|zu 
cgggc?gScg tcccagcagg ggcggcagga gagagaggct ^ctacaca ^tgcccatc 1980 
atggaagagg cggccatgct gtggccagcc aggcaggaag agacctttcc tctgacggac 2U4U 
cac?aagctg gggacaggaa ccaagtcctt tgcgtgtggc ccaacaacca tctacagaac 2100 
agctgc?ggt gcttcaggga ggcgccgtgc cctccgcttt cttttatagc tgcttcagtg 2160 
agaa?tSt Scgactcc acagggacct ttcagacaaa aatgcaagaa gcagcggcct 2220 
ccectgtccc SgcSgcttc cgtggtgcct ttgctgccgg cagcccttgg ggaccacagg 2280 
cctoaccagg gcctgcacag gttaaccgtc agacttccgg ggcattcagg tggggatgct 2340 
ggtSttSa fatggagagl accttgactg tgttttatta tttcatggct tgtatgagtg 2400 
tgactgggtg tgtttcttta gggttctgat tgc 



<210> 51 
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<211> 1772 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1553725CB1 



cga.ctg f . gcatttgtcc cjgjj-JJ t-Jgtt tjtgtg « 

Z= SSSS SS SS SSc^c „ s? t-ggg ij. 

. tgagcgggag ctggactggg ccaaggtgat S^ggagaag agcagga gg MJ 3Q() 
cccaggcacc caagtggagc agctgctgta tgccaagaag ctgtatga t 360 
ccccgacact ggggagaaga tgaatgtcat cgggcgcacg ^ tgatcttctg 420 

catgatcatc acgggcttca tgctccagtt <*acaggacg atgccggcgg | 48Q 
gcagtgggtg aaccagtcct tcaatgcctt agtcaactac accaacagga i***^ 540 
ccccacatca gtcaggcaga tggccctttc «accccaca 9 gccgctgggt 600 

cacggctgtg ggcatgaaca tgttgacaaa saaagcgccg ^ccttggtgg 9 * 660 
gccctttgcc gctgtggctg cggctaactg tgtcaatatc cccacgag attcccggag 720 
gctcataaag ggaatctgcg tgaaggacag ^atgaaaat jagattggtc attcccgg g ^ 
agctgcggcc ataggcatca cccaagtagt tatttctcgg atcaccatgt c g ^ ^ 
gatgatcttg ctgccagtca tcatggaaag gcttgagaaa ttgcacttca 9Q0 
caaggtcctg cacgccccat tgcaggtcat ^tgagcggg ^ atctggaacc 960 

gccagtggcg tgtgggcttt tcccacagaa atgtgaattg ccagtttcct 1Q20 
gaagctccaa gacactatca aggccaagta tggagaactt gagcc a tcaccagctc 1080 
Lagggtctc taaatgcccc acttcagcaa ^cagtct attcccatat ^ g^ ^ 
ctccttagct acgtgcacac ttgtgtcctc cttccccttt gcc act ggttgat 1200 
agggtagatt ggggggtggg acaatgaatg cctcatactt acaccccggt 1260 
tggacc?cag gggaaaaaag tgaaaaaggg tagcaaaggc caatgtcttc tagctg^ ^ 
ctcaacccct gtcccctgga gaccagaagc tgaggccctc tcagggagg | gttgaggaa 1380 
caaatcattt ggaaaagtta S^ccttt aggattctgg ttccagccag WJJ*^ ^ Q 
aagaccttgg atcaaaagga agcttctata cctctttccc cu « aaaqggaaag 1500 

agcaatggaa acttttaccc atgtaattct agctgaactc jggaaaaaga aggggg | ^ 

SSSS5 SSSS KS£ S=£S gt-jU gg-g S j 
&S 25S SKSS =S2 SSSS 

attttagtgg agacggggtt tcaccatgct gg 



<210> 52 

<211> 1874 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1695770CB1 



gctcccagc* ggcaggcta, ««^" SSSS So 
aga.gccagt agggttatct gtgtcaggat =»«tc««g f9»*"«"° clcttttgcg 180 
caggtaaag. taggccagag aagaagaggc Maagaggag tatcccclct 240 

•gcagcccag gttggagaaa S9«=**»= ""J?«=c gcagaalgac aactcttgtt 300 

sr ss ssss . s s Uj- a 
ssesss ssss sssss * i S bsss as 

Ettaaaggcc cacccgtgaa ogtgacctgc aacatcttca ^aacagttt c g 9^ ^ 
accaagacca caatggacta ccgggtgaat fcttcttgc g^acagtg g « 660 
cgcctgtcct accgagaata tcctgatgac tctctggacc tcflatccctc 720 

SS SSS2S SSSi 5*-** cagcatcagg 780 
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ctgaccctca ttttgtcctg cctgatggac ctcaagaact tccccatgga catccagacg 840 
tgcacgatgc agcttgagag ctttggctac accatgaaag J^t* 96Q 
gaagatgctc ctgctgtcca agtggctgag SOTctgactc eg 9 cacc tgcatc 1020 
gatgagaagg atctaggctg ttgtaccaag ^tacaaca ^aggg tacatcccc 10 80 

gaggtaaagt ttcacctgga aeggcagatg ^tactatc tgatccag g tcct 1140 
agcctactca tcgtcatcct gtcctgggtc tccttctgga tcaacatgg g | 12Q0 
gcccgtgtgg gectgggcat caccaccgtg ^caccatga ccacccag g 1260 
cgggcctctt tgcctaaggt gtcctacgtg aa 99° aa ^? tSattttgt Itctcgtcag 1320 
c?gctctttg tgttcgctgc ettgetggag tatgetgeca taaattttgt t ^ 138Q 
cataaagaat tcatacgact tcgaagaagg "gaggegee aacg gg * a 1440 

atccaagaaa gtcgtttcta tttccgtggc tatggcttgg g^actgect 9 ™ 15QQ 
gatggaggtc caatggaagg ttctggcatt tatagtcccc ^tccag ^ 1550 

agggaaggag aaaccacgcg gaaactctac ^ggactgag ccaagag g ^ ^ 

tcccgggctg tcttcccttt cactttcctc ^cttcaata ^ttctac g W^ fca ^ 1680 

ssss SK ssss ilSr~tr=^= ss 

.ffiSSSS SKS SS5S r««*t xeeo 

ccttaaaaaa aaaa 



<210> 53 

<211> 6211 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> irtis cofeature 

<223> Incyte ID No: 4672222CB1 



ssss SS 3SSS SEES 2. 

tggctacagg ccgcggagga ggaggaggtg ffagttgctgc ^"ceggag J 300 
gaggagaatg tcccagaaat cctggataga aagcactttg J-aagaggg 360 
tattatacca agttccaagg accctcacag a ^cttcca gg « ct gcaagtct 420 

actegtcagg tgtttttgtg f cgcttggt caagcaacat J^tgtttt ^g ^ 
tgccatgaaa tactcagatg tgaaattggg ^gaccattcc aa w taaattttca 540 . 
g?ctgtggaa aagcatacag aacagageco a«c«ataat £ca?Xgaca ccaaacctga 600 
agggggttct cattcctaca ^gctaagta tgtgaggcta ^9 ^ 660 

agtcattctg caacttctgc ttaaagaatg geaaaeggag ^ tgcttggaaa 720 

tgtacatggg ggcatgeaga aatttgagct ^acccacga Jtcaagcagt tg ?80 
aggtcttatt aaagctgcag "acaactgg ^"tggatt "aactgg g g | ^ MQ 
aggtgtggca aaacatgttg sagatgeect jaaagaacat gctt g atcttgttgg 90 0 
gatttgeact ateggaatag ctccatgggg agtga£tgaa aac g g tgtttt 960 
gagagatgtg gttgetcett Jtcjjacctt a "gaacccc ctgag agtatggggc 1020 

gaataatctg cattcccatt tcatattggt OTatgacggc a aa ttcatgetag 1080 
ggaagtcaga ctgagaagag aacttgmaaa ^tattaat gg ? gggc caa atgttatcct 1140 
gattggccag ggtgtccctg tggtggcact tatatccgag gg ««» gtgaagga ac 1200 
Sacagttctt gaataccttc aggaaagece ccctgttcca ^agttgcgc 126Q 
aggcagagct geagatctge tagegtatat tcataaacaa acagugug J c «J 1320 
tectgatgea gcagagcccg a tattatttc jactatcaaa a aggagcttat 1380 

gaatgaagca cttcatttat ttcaaacact jatggagtge «gg tacttactgc 1440 
cactgttttc catattgggt cagatgaaca J^aagatata ^SStScat tggcatggga 1500 
actgltaaaa ggtactaatg catctgeatt Jgaccagctt a ^tacat gg^g". 1560 
tagagttgac attgecaaaa a tcatgtatt tgtttatgga ^agcagtgg taaaacttct 1620 
cttggaacaa getatgettg atgctcttgt aatggataga gecg g aagaacttta 1680 
tattgaaaat ggagtaagca tgcataaatt ^cttaccatt ccgag gg ca 1740 

caacactaaa caaggtccaa ctaatccaat ^gtttcat ^ttgttcgag ^ a 1800 

SSSS SSSSS SSSS c£«c S »» 
SSSSS SSSSSS SSSS S5SES — • 
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ccatttcatt aagacagcac agccctaccg accaaagatt gatacagtta tggaagaagg 2040 
aaaaaagaaa agaaccaaag atgaaattgt agacattgat gatccagaaa ccaagcgctt 2100 
SSSSSa c?taatgaac ttttaatttg ggcttgcctt atgaagaggc aggtcatggc 2160 
ccatttttta tggcaacatg gtgaagaatc aatggctaaa gcattagttg cctgtaagat iiiv 
ctatcgttca Sggcatatg aagcaaagca gagtgacctg gtagatgata cttcagaaga 2280 
actaaaacag ta??ccaatS attttggtca gttggccgtt gaattattag aacagtcctt 2340 
cagacaaga? gaaaccatgg ctatgaaatt gctcacttat gaactgaaga actggagtaa 2400 
t?caacSgc cttaagttaj cagtttcttc aagacttaga ccttttgtag ctcacacctg 2460 
tacacaaatg ttgttatctg atatgtggat gggaaggctg aatatgagga aaaattcctg 2520 
gtacaaggtc atactaagca ttttagttcc acctgccata ttgctgttag agtataaaac «bu 
?aaggc?gaa atgtcccata tcccacaatc tcaagatgct catcagatga caatggatga 2640 
cagcgaaaac aa?tttcaga acataacaga agagatcccc atggaagtgt "aaagaagt 2700 
acggattttg gatagtaatg . aaggaaagaa tgagatggag atacaaatga aatcaaaaaa 2760 
octtccaatt acgcgaaagt tttatgcctt ttatcatgca ccaattgtaa aattctggtt 2820 
?aacacg?tg gcatatttag gatttctgat gctttataca tttgtggttc ttgtacaaat 2880 
ggaacagtta ccttcagtt? lagaatggat tgttattgct tatattttta cttatgccat 2940 
?gagaaagtc cgtgagatct ttatgtctga agctgggaaa gtaaaccaga agattaaagt 3000 
a?ggtttagt gattacttca acatcagtga tacaattgcc ataatttctt tcttcattgg 3060 
atttggacta agatttggag caaaatggaa ctttgcaaat gcatatgata atcatgtttt 3120 
StgSgga agattaattt actgtcttaa cataatattt tggtatgtgc gtttgctaga 3180 
tttSSSS gtaaatcaac aggcaggacc ttatgtaatg atgattggaa aaatggtggc 3240 
caatatgttc tacattgtag tgattatggc tcttgtatta cttagttttg Otattooag 3300 
aaaggcaata ctttatcctc atgaagcacc atcttggact cttgctaaag atatagtttt 3360 
XSSSSSk tggatgattt ttggtgaagt ttatgcatac g-attgatg gtgtgcaaa 3420 
tgattctgtt atccctcaaa tctgtggtcc tgggacgtgg ttgactccat ttcttcaagc 3480 
agtctacStc tttgtacagt atatcattat ggttaatctt cttattgcat "ttcaacaa 3540 
tgtgtattta caagtgaagg caatttccaa tattgtatgg aagtaccagc 9"atcattt 3600 
tattatggct tatcatgaga aaccagttct gcctcctcca cttatcattc "agccatat 3660 
agtttctctg ttttgctgca tatgtaagag aagaaagaaa gataagactt ccgatggacc 3720 
aaaacttttc ttaacagaag aagatcaaaa gaaacttcat gattttgaag agcagtgtgt 3780 
?gaaatgtat ttcaatgaaa aagatgacaa atttcattct gggagtgaag agagaattcg 3840 
tgtcactttt gaaagagtgg aacagatgtg cattcagatt aaagaagttg gagatcgtgt 3900 
caactacata aaaagatcat tacaatcatt agattctcaa attggccatt tgcaagatct 3960 
ttcagccctg acggtagata cattaaaaac actcactgcc cagaaagcgt cggaagctag 4020 
caaagttca? aagaaatca cacgagaact gagcatttcc aaacacttgg ctcaaaacct 40 80 
tattgatgat ggtcctgtaa gaccttctgt atggaaaaag catggtgttg taaatacact 4140 
tagctcctct cttcctcaag gtgatcttga aagtaataat ccttttcatt gtaatatttt 4200 
aa?gaaagat gacaaagatc cccagtgtaa tatatttggt caagacttac ctgcagtacc 4260 
ccagagaaaa gaatttaatt ttccagaggc tggttcctct tctggtgcct tattcccaag 4320 
tgc?g?ttcc cctccagaac tgcgacagag actacatggg gtagaactct taaaaatatt 4380 
taataaaaat caaaaattag gcagttcatc tactagcata ccacatctgt catccccacc 4440 
aaccaaattt tttgttagta caccatctca gccaagttgc aaaagccact tggaaactgg 4500 
aaccaaagat caagaaactg tttgctctaa agctacagaa ggagataata cagaatttgg 4560 
agca??5ta ggaLcagag atagcatgga tttacagagg tttaaagaaa oatcaaacaa 4620 
gataaaaata ctatccaata acaatacttc tgaaaacact ttgaaacgag tgagttctct 4680 
ScSgattt actgactgtc acagaacttc cattcctgtt cattcaaaac aagaaaaaat 4740 
cagtagaagg ccatctaccg aagacactca tgaagtagat tccaaagcag <=tttaatacc 4800 
ggattggtta caagatagac catcaaacag agaaatgcca tctgaagaag gaacattaaa 4860 
?ggtc?Sct tetScattta agccagctat ggatacaaat tactattatt cagctgtgga 4920 
aagaaataac ttgatgaggt tatcacagag cattccattt acacctgtgc ctccaagagg 4980 
ggagcctgtc acagtgtatc gtttggaaga gagttcaccc aacatactaa ataacagcat 5040 
g?c?tcttgg tcacaactag gcctctgtgc caaaatagag tttttaagca aagaggagat 5100 
gggaggagg? ttacgaagag ctgtcaaagt acagtgtacc tggtcagaac atgatatcct 5160 
caaalcaggg catctttata ttatcaaatc ttttcttcca gaggtggtta atacatggtc 5220 
LgtattS? aaagaagata cagttctgca tctctgtctg agagaaattc aacaacagag 5280 
agLgcacaa aagcttacgt ttgcctttaa tcaaatgaaa cccaaatcca taccatattc 5340 
tccaaggttc cttgaagttt tcctgctgta ttgccattca gcaggacagt gfttgctgt 5400 
ggaagaatgt atgactggag aatttagaaa atacaacaat aataatggag atgagattat 5460 
?ccaactaat ac?ctggaag agatcatgct agcctttagc cactggactt acgaatatac 5520 
aagaggggag ttactggtac ttgatttgca aggtgttggt gaaaatttga ctgacccatc 5580 
?g?ga?aaal gcagaagaaa agagatcctg tgatatggtt tttggcccag caaatctagg 5640 
agaagatgca attaaaaact tcagagcaaa acatcactgt aattcttgct gtagaaagct 5700 
taaacttcca gatctgaaga ggaatgatta tacgcctgat aaaattatat ttcctcagga 5760 
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t gas ccttc. g .t ttg «a t c t t =«c=t„ a r ttcc,cc aaagatca, a™a. 5820 

2SS5S SSS5S SSSSS |S , - = 
SSSS USES SKS S ™« gs 

££££ SSSS SSSSS SSSS — ?— . 

taggtgtcca tgcagatttt attggcagct g 

<210> 54 

<2U> 3714 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 6176128CB1 

ft™, ccaagctgcc gcgctcgccg t = ca aggcgggccc ggg™ 60 q 
ccagccggcg ccgcagcccc cgaggagcct cacgggctca 9«=|«gcc ? tt ggcctg 180 
ggcgggggct ccgtgggcag cgacgtgggc cagaggcttc ctgtagaaga tttjj g ^ 
gactcctccc tgtctcaggt ccaggtggag "«acgtca J^aga tgttcaac 3 00 

cggctcaagc tgttcttcat caaaaaccaa agatcgagtc tmptoq J ga ? gacccg 36O 
ttctccctga agctgctcac ctgcctgctc tacattgtgc S^ctgct t gac?cgtcc 420 
gccctgggca tcggatggtg gggctgccca ^Qtccttcaa eg | 

lecgagatea actgggctcc tattctgtgg Stggagagaa jgatgacact gtggg g 
caggtcatcg tggccataat aagcttcctg ^gaegatge ttctcatcta cctcg^ ^ 
aaaggcaaca tctgggagca gatcttccgc Q^gtccttcg tecegg g J cccgtcttt 6 60 
ctgcccttca tcatcacgat cttctggccg ccgctgcgga acctgttcat cc 3 ?2Q 
ctgaactget ggctggccaa scacgcgctg ^aaacatga "aatgactt ?80 
atectgegga cacagtcagc catgttcaac ^9^cctca tcctct g c 840 

tgcctcgttt teaeggggae ctgcggcatc cagcacctgg ^gggyyy etaeggtgae 900 
tScctcctga cctccttcta cttctgcatc Stcaccttct ^acegj^gg ^aegg S 960 
gtcacgccca agatctggee ategcagctg ctggtggtca ^atgatctg eg gg ^ 
atggtgctcc cactgeagtt cgaggagctc gtctacctct as 3 ^^ u Stotatcaac 1080 
ggggg?aact acagccgcca ccgtgcgcag aeggagaage ^cgtggtcct ^gtgtcagc 1U« 
?SScaaga tcgaccttct catggacttc ctgaacgagt tctacgccca «cccggctc 1 
caggactatt acgtggtcat cctgtgcccc aeggagatgg atgtccaggt 1260 
ctgcagatcc ctctgtggtc ccagcgggtc atctacctcc agggctctgc a « 320 

ssss s= sssss s ss 1 Hssl - 

52SSS SSS35S SSS SSSS SK-« g&S S3 
=252= =S= SSSS J » g ii ~ SS 

gaggtgtacc acatccgcat gggtgacagc aagttcttcc S^gtacg ggg y a 
Itcacctacg cggccttcca cgcccacaag aagtatggcg tgtgcctcat oggg g g 
egggaggaca acaagagcat cctgctgaac ccggggcccc gcacatcct ^ 1860 

SSSS —gfg SSSS tj acgaggg IgO 

SSSS £535 SSSSS f S SSSS 28 

gagaaegget cgggcagccg gcggcccagc atcgcgcccg tcctggaact ggcega g 
tcagccctgc tgccctgcga cctgctgagc S*ccagtcgg ^atgaggt gaege g g 
gacgacgagg ggctctccgt ggtagagtat ^gaagggct accctcccaa ^tege 
ateggcaget ccccaaccct gtgccacctc ctgcctgtga aagccccctt g g ^ 
eggctggaca agggctgeaa gcacaacagc tatgaagacg ^aaggect ««« 2400 
aacaaj?tga tcatcgtctc ggcagagacg geeggcaatg ^ctgtacaa ettcategtg ^ 

ssss ssss ssss =gs Sss SSSS S! 

??gS C t gg SSSS g g=S SSSS SS~* £A~~ 26,0 
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aagaccatcg tcaacgtgca gaccatgttc cggctcttcc ccagcctcag catcaccacg 2700 
gagctcaccc acccttccaa catgcgcttc atgcagttcc ^gccaagga 2820 
ctggctcttt ccaaactaga aaagagggag gagagaatg ^ccaa JJcactgctc 2880 
ttccgcctgc cgttcgccgc cggccgcgtc ttcagcatca ^cgccgg 2940 
taccagtcct tcgtgaagga ctacatgatc accatcaccc ^tgctgct 9^ ^ 30Q0 
accacgccgg gctcggggta cctctgtgcc atgaaaatca ccgaggg g ^ 3060 
cgcacgtacg gccgcctctt ecagaagctc tgctcctcca gcgccgag^ ^ 3120 

SSSS 3SSE SSSSS S "ESS SSS SS 
SSSS SSSS S5SSS S=S SS55 £™ »S 

gcgLSaagc aggcaggccg ggcggcggcc 0cgg»*m ^agccagca gcgcc g 
. ctgtaccggc gctctgagcg ccaggagctc tccgagctgg gaagaaccg cacg g ^ 
ctggggctgc ccaccaccgg ctacgaggac gtagcaaatt taacagccag 9 a 
aa?cgggtaa acctgggata tttgcaagac gagatgaacg accaccagaa caccctctcc 

SS 555 5S5 IS ss =r~ ss 



<210> 55 

<211> 3115 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7473418CB1 

<400> 55 



Sss sssss sssssrs ssssss Si iglii as 
SSSS ss = S 1 ~ ssss ss 

cagtacatga aggacaccaa catgctgttc ^tgggcggcc * gggggccaag. 360 

SSSSS SSSSSS sssss ssss t ss^ J- 

ssss ssss sssss SSK sss= SS "SSSS s 

ctgccagcta acagcgctgt gcccaccaca SMjgteaag 660 
ctggggoagc aggaagaoca agagcggaag »ss«gtgta caaSglggtg 720 

SSSS SSSSS SSSS SSS5 -S Saa 7J. 

SSSSS SSSSSS SSS. SSS. SSSSS IgaSaSaag N. 
SSS. SSSSS caggtgctg caggaggagt accggaagct ggggccc tg ,« 
tccttogcgg agatcaacgt gctgiW SSSS SSaaaact 1080 

SSSSSSSSS SSSSSSS SSSSS SSSS S.|aS t g™c s , 
SSSSS SSSSS SSSSS SSSSS SSSS S ace neo 
SSSSS SSSSS SSSSS SSSSS = a SW jjj, 

SSSSSS SSSS SSSSS SSSS SSSS Saaacagg. ,00 

SSSS SSSSSSSS SSSS tg-,-. 

SSSSSSSS SSSSSSSS SSSSSSS SS SS S g ajScaacgat gtccacacac JJJ. 
caccaaaafc cagccaatgg gccacctctt SSSSSSSSS SSSSccatS I860 

SSSSSSS SSSSSSS SSSSSSSS ssssss SSSS ~ «. 
SSSSS ssssss SSSSS SSSSSSSSS ssssss sss. 
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t,cca g «cc =at 3 tc==« ccasctc.cc ttjjgj- tc„atcatc tctatcactc ,100 
tggaagggac accccagcca gggacggaat g c ™££" 9«9 ggccaccacc 2220 

ag?gcgagtg ggaatcagag cctcctgaag cctctgggaa ctcctcctgt 2280 
aaaggatgag gaatctgagt tgccaacttc aggacgacac ctggc g ctg 234 0 

caccacaggc caacctacgc ccttcatcac "ggttctgc zzz J ggt caacatt 2400 
tcccacctct ccagtgagcc tccttcaact "ttggtccc ctgugc g gg tcatct 2460 
tgccgagacg ccttggctgg caccctctgg ttctcccagg JW* 2520 

tttctgggag atgcttcccc tg^catcccc ^^tag 9 ctccctcac ctgaggccca 2580 
cagtgatggc gctctggggg ^acagtggg ctatctaggc °ct catgccagcg 2640 

gagtggacac agctgttaat ttccactggc tatgccactt cag g t 2700 

tttgagctcc tctgggtaaa atcttccctt tgttgactgg cc » aC aacccctg 2760 

gacaacagag gatcgttgag attgagcagc gcttggtgat gcaacaactc 2 820 

. cccgtgggcc aatctacttg aagttactcg ^caaagacc ^aaagtggg g c 288Q 

cagagaggct gtgggaatct tcagaagccc "^gtaaga gac g g ^ gca 2948 
SS22S SSS5 Tgga g SSgi tgjjg atgagagcct 3000 
SSSK 2SSS — ,115 



<210> 56 

<211> 2846 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7474129CB1 



t^agccat ccctctgcct gcaatgagag cttcccgccg cctcagccac agtcccaccc 60 q 
gggggccttg ggccccagac atgcggtgat ^tcagggcaa gsgtgccac 3 180 
Scctcacca gccatgaaag cccaccccaa mptggtg ^tcatgg agatcacccc 240 
tgctgccccc agtgggaacc ctgccgtcct gccagagaag jggccggcgg | 300 
cacaaagaag agcatctctg fftaaotgtga tgacatggac tccccccagt 360 
tgatgtgaca gagaccccat ccaatcccaa ^agccccagt gca g gg tg 420 

gcagaggagg aaaaagaggc ggctgaagaa ^g^cttt gc g g a ggcgccat ga 480 
cgtggaggag ttggtagagt tgctggtgga £"tgcagg ggaagacctg 540 

tgaggatgtg cctgacttcc tcatgcacaa ^oggcc tccgacacgg f 6QQ 
cctgatgaag gccttgttaa acatcaaccc ^aacaccaag acaca gagga 660 

tgcctttgct gaagagaacg acatcctggg ca ^tcatc aa^gg catcgc 7 20 

ggcctatgaa gggcagacgg cgctgaacat °£^e£* S?tcttcaa 780 

agccctgctc atcgccgccg gcgccgacgt ^aacgcgcac gcca ggggg 840 
ccccaagtac caacacgaag gcttctactt ^tgagacg cccccrocc^ a = atca cctc 900 
caccaaccag cccgagattg tgcagctgct SJtwagcac J^g^J aggacttcaa 960 
gcgggactca cgaggcaaca acatccttca cgccctggtg accgcggc g s* 1020 
gacgcagaat gacgttgtga agcgcatgta cgacatgatc ctactgcgga ^ ^ ^ 
ggagctggag accactcgca acaacgatgg cctcacgccg 9^ agcggctccg 1140 
gggcaaggcg gagatcctga agtacatcct ^agtcgtgag a gg a t 1200 

gagcctgtcc aggaagttca ccgactgggc g^cggaccc ^ tctacaaoac 1260 

cc?caccaac gtggacacca ccacggacaa ctcagtgctg S£££^ tgctgcatat 1320 
caacatcgac aaccggcatg agatgctgac ^ctggagccg ctgcaca g * ^ 1380 

gaagtggaag aagtttgcca agcacatgtt ctttctgtcc «ccg^ CC atcccgca 1440 
caacatcacc ctgaccctcg tctcgtacta "gcccccgg ??agggagga tgtttgtgct 1500 
ccccttggcc ctgacgcaca agatggggtg g^agctc ct ^f^ | 1560 
catctgggcc atgtgcatct ctgtgaaaga OT»ttacc atcttcctgc ^ 
ggatctgcag tccatcctct cggatgcctg g"£cactcc J™J tcgcctgcct 1680 

gcttgtgata ctgtctgtct tcttgtactt gtttgcctac SSSSgg gStccagtc 1740 
cgtgctggcc atggccctgg gctgggcgaa "tgctctac uuogogn | gttctt i800 
catgggcatg tacagcgtca tgatccagaa ggtcattttg ^gatgttc eg 9 60 

S5SSS-S22S 5SSS S« H S sfe »» 
SESSS SSSS! S22S5 SSSS S-S— 



80/85 



PCT/US01/24217 

WO 02/12340 

catgctcatt gctctgatgg gcgagactgt ggagaacgtc tccaaggaga ^gaacgcat 2100 
ctggcgcctg cagagagcca ggaccatctt S^gtttgag ^aaatgttac ^agaatgg 
gaggagcaga ttccggatgg gagagctgtg caaagtggcc ^tgatt tccgactg g 
?t?gcggatc aatgaggtga agtggactga atggaagacg cacgtctcct "a 
agacccgggg cctgtaagac gaacagattt caacaaaatc caagattctt ^cagg 
cagcaaaacc actctcaatg catttgaaga agtcgaggaa "cccggaaa 9 
gaagcggaac ccagagctgg tgtgcgcgtg cgctgtctgg ^tgcaggc ggagc s 

Ict?tg?gca gagaggcttt gagggatgat agJ^SS-SSSSt gSa?ttgt 2580 
gtgcaccctc gtgctcagtg ctcagtgggt gtctgaactg ^gggscagcu gc * 2640 

ctgagtggga aacatcctgg attttgttac ttggcaaaca ^tggtgtaa ^cctacag 

n SSS S53 S Ssss sss s 

cattgtacca tctcttctgc tgtgac 



<210> 57 

<211> 906 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> iaisc__feature 

<223> Incyte ID No: 7481414CB1 



§3 EiIS Ess Ei§ |il IP I 

S Ss S|| s§ is SSI 5 

gcgctggctc tctgccccac tgagcttgtg aa ^gtcggc taagagtatc 480 

aagatgtcag ggaagatagc acaaagctat ^cacaattt g^ggc ^ 54Q 

££££ S5SS 3Sg£ Sjg-t ss ... 

SSSS SS2ES 2E2S ££££ — ?S 

tctatgtttg ggaagcctgc aggattaatc ga»acc"ta "agtgttgt gaga g 

SSSS SS3SES 3SSSS SSSSS 552-* gg™. 



tactga 

<210> 58 

<211> 1840 

<212> DNA 

<213> Homo sapiens 

<220> 

<22i> iuisc_feature 

<223> Incyte ID No: 7481461CB1 



g^cgcct gcgcgctggc cgcctgcgcg ctgccagccc gcccgcccgc -ggggctcc 60 
Iccgccctcg cctcggcctc gttagcccgc caggagcccc g^gctcctc W 
ctggtaactc gcgtccctcg cgcttctccg »cgcctgagg W««gcrtc Wjcaw 

gcKtcccag gaggagccgg actccgcgcg SSSS ccgagglggc 300 

Igcgcccacg ggggccgctc cgccgcccgg cccgggaccc tcggacagcc ccgagg gg ^ q 

tgtcgagaag gtggaggtgg agctggcggg sccggcgacc ^ggagcccc ^gag ^ 

cgaacccccc gagggcggct ggggctggct OTt^g ^catgt Kgaaacc?? 480 

SE22 SS5= SEES KS2S | fe- «; 

SSSSS SSSS sssss S3SS2 ssss *s«s~ 
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aagttccatc gagcctctgt accttaccta tggaatcata tttgcctgcg gctgctcctt 720 
tgcataccag ccttcattgg tcattttggg acactatttc aag g g taag 840 

gaatggcatt gtcactgctg ^agcagtgt =ttcacaatc ctgc g t * ttcatgtt 90 0 
ggttctgatt gacagcgtgg gcctctttta c acattgagg Iccaatacca aagataaaga 960 
Sttctcttt ctggctggct ttacttaccg acctcttgct accagtacca aaga^j^ ^ 
gagtggaggt agcggatcct ccctcttttc ^aggaaaaag ttcagtc gaa taccact 1080 
tttcaatttt gccatcttca aggtgacagc "atgcagtg ^cagttg | n40 
tgcacttttt ggatactttg tgccttatgt tcacttgatg -acatgtaa J^*^ 
tcaaaatgaa aaaaataaag aggttgtccu catyty^u ^rj a ^ aa( , ft *-t-t-at-rtaca 1260 

SSSSS 3SEE 5S5SS 5SKK £S Sgfc «. 
85=2 2S3S ?| US -SSS S8 

tctgctcgga ttcatgtcta tacccatgac tgtggccca J ^ cttattgg 156 0 

tgacaaactg ggctcctatg atgtggcatt ctacctcgcc sgag agatC agtaa 1620 

22KS S32S £ i bM SSS SS 
25SS5 SSSS " Sttttat a ™ ca 

■ aatttcatat ttttttaatc acatcctagg aatagcacaa 



<210> 59 

<211> 5348 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 7472541CB1 



gj^ga ggg* ££££ SS=S «££ 120 

3SS55 SSSS £223 a ~t tgctggggct tcctttgaca 180 
agagaatcag aaccgactgg tgacatttgt "caatgaaa scaacagtgt 3 3 300 
.gcaatttatt ctgatcaaca atttgcccag ctaaagtact acatctgccc c « 6Q 

?ggctgtagg ggcacagcaa aggtcactgg tctaacctcc "^gggac tccgcc 
ga?cttcgcc tgctgctgga aatggccctc ^agtggact catcgtggca tcgg ggj ^ 
tggagagtca gagatggctt cccccattgt ccatcggaaa ^cacaccgcc 3 540 
gagaaaggga gacagagcta caacttgaca «£SK5'S££SS SSSc-Ja 600 
atattccatc aagattggga agaggtctcc aggagatacc ctggcaac g tagatgg 660 
accaaataca ccctcttcac cttcctgccc cggaatctct "gagcaatt tcatag gg ^ 
gctaacctct atttcctgtt cctggtgatt "gagctgga tgccctccat gga g ^ 
cacagagaaa tcaccatgtt accattggcc attgtcctgt tcgtcatcat g 33 84Q 
ggcatggagg acttcaagag acaccgcttt 3ataaagcaa taaactg 900 
atttatgaaa gaaaagagca gacctatgtg ^agaagtgcc gga gg a * cttttttcc 9 60 
gacttcatcc aaatgaaatg caatgagatt £°«agcag acatactcct c 1Q20 
tctgacccca atgggatatg ccatctggaa actgccagct tggatggaga J 1Q80 
aagcaaagac gtgtcgtgaa gggcttctca cagcaggagg ^acagttcga accag g 
ttlcacaata ccatcgtgtg tgagaaaccc aacaaccacc tcaacaaatt ^aaggg 
atggagcatc ctgaccagac caggactggc tttggctgtg agagtcttct g gg^ ^ 
tgcaccatca gaaacaccga gatggctgtt gfcatugtca « gcggcgcatg 1320 

aaagccatgc tgaacaacag tggcccccgg tacaaacgca scaagattga gcgg g a 
aatatagaca tcttcttctg cattgggatc ctcatcctca tgtgccttat ^ 
ggtcacagca tctggaatgg gacctttgaa gaacaccctc cccwgag aatgatcatc 1500 
aatggcagct tccttcccag tgcccttggg tgttcctcac aatgat ^ 

ctgctccagg tgctgatccc catctctttg tatgtctcca ccgag gg » ttatccatt 1620 
caagtgttct tcttgagcaa tgaccttgac ctgtatgatg aagagac^ cttctccgat 
caatgtcgag ccctcaacat cgcagaggac "sggccag « catgggcagc 1740 

SS5S SS5SS | S sfe SS 

SS2SS ESS SS5S SSSK 
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cccatccagg 
gtggccttca 
gttcgagatg 
ctctccacca 
atggtgtcca 
gaccattgtc 
aaaagtacct 
attccacttt 
acgtccctgg 
tcattctcat 
gccaccacag 
gatgacggtg 
ggcacttcct 
tgttacgagg 
ttcacactag 
ctcaccttca 
gttgtgaggc 
atcatggacc 
agaaaaatcc 
acactatgca 
cggcgtgagg 
catctggaga 
ggagttccag 
ggagataagc 
gacactgttt 
gcattggaag 
ttccgcttac 
ttggtcatcg 
ctggaattga 
agtatgatag 
ggagcaaatg 
gaaggcatgc 
aagttgctgc 
tacctctaca 
ttctccagct 
tccttgcctc 
gcattgcctg 
tggatttcta 
gcctataagg 
ctcaccacaa 
gtcgtgctcc 
tgcgtcatct 
cccactttct 
ttcctgtctc 
aaactccccc 
gcccctgtcc 
gacttcagtg 
tcagtgctcc 
gactcctcag 
tactcaggag 
cagagttcac 
tagtcactgg 
tcttattctt 
tgacacaaag 
cttgcaaaca 
gaccaaagcc 
ttccaagatt 
tttattta 



gccactaccg 
gcagctccat 
ctgccctgtg 
cctcctccat 
caaccaccga 
aatgcttaga 
tcatattccg 
ccttctttgg 
agaagattca 
ccactgcacc 
actcggatga 
gctacaggag 
tggaggaggc 
ctgagagccc 
tgtcccggac 
gcctcctctg 
acccactgac 
tgctggaaga 
gagcccggac 
ttgccaagaa 
ctgaggcatc 
atcaactcac 
atacgattgc 
aggagacagc 
ataccatcaa 
agctaaagca 
cttccaagac 
atgggaagac 
cccagtattg 
tcaagctggt 
atgtaagcat 
aggctgtcat 
tcgtgcatgg 
agaacgtgtg 
ccaccatgat 
ctcttgtctt 
agctatacaa 
tggtggatgc 
gctctgatat 
tccttttgca 
tcggcagctt 
gcaacagccc 
acctcgtctg 
tgcaaggaac 
cagacaaaag 
ccgaagtggc 
ccagcacccc 
acgaacagag 
ctcaactctc 
gacagactga 
tgaccatatg 
tccacataac 
ttcttcaaca 



acgggaaacc 
gtttggagaa 
attcttctga 
caagaagtgt 



gcaaaggtct 
agaaaaagat 
gttggagacc 
tgctgatttc 
gcccaggcag 
atttcagggc 
cataagacaa 
ccacaaggtc 
gcagctcttc 
ctctgacaca 
gagagatgat 
cagcatgtgg 
attggaggcc 
tgatgaggcc 
acctgagcag 
caccctgggc 
tggcgagatt 
cccagcctgc 
ccaaaagcat 
ggttgtaagc 
cctcgacaac 
cttacttgga 
cactctgcgg 
ggtcaacatt 
tacagagaat 
atttcgtgaa 
accatccatc 
attgaatgcc 
tcggtccgtc 
gcgagacaag 
gattcaagct 
gtccagcgac 
ccactggtgt 
ctacgtcaac 
tgattactgg 
tggagtcctt 
gagtggccag 
attctaccag 
agatgtcttt 
ccaggcaatg 
cctgatgtac 
caccaatccc 
ctttctcaca 
ttgtgggaag 
aaacctggaa 
tcgaccaact 
aaagagctct 
atgtggcacg 
atccggggag 
tatgtgccgg 
aggagctgca 
ccttgacccc 
accttgactt 
cagtcggcct 
ggaggctgtt 
ccctctagat 
atcaacccag 



atggggcacc 
gtaactccag 
ttgtcagaca 
ttccttgcct 
aggtgggatg 
tggaggaaaa 
cttggtatta 
accatcaaac 
cagaagttga 
gacctcgggg 
gcatctgtgt 
gaccagggcg 
. ccagccacag 
gccctggtgc 
gtgactgtgc 
tttgactctg 
gttgtctaca 
gtacctgaca 
ctagacttgt 
- gaagaggact 
cgagatgagc 
gccactggga 
gaggctggga 
gcccattcct 
caggagacct 
ctacagaagc 
acctcagaag 
atcttccagg 
ctgtgctgcc 
ttgcgcgtca 
gctgatattg 
tttgccatca 
tactcgcgcc 
ctgctcttct 
cagatgatat 
gacaaagaca 
aactctgagt 
agcctcatct 
acctttggga 
gaaatgaaga 
tttctggtat 
tattgggtga 
ccagttgttg 
tctctaatct 
atccagagtt 
caccacccag 
aaccctccca 
gagtgcatga 
cacctgctgg 
tgctcaaaga 
gaaatctgta 
ttcttcttca 
ccatggagga 
ctagttttct 
tttgttgaat 
aagcgtagcc 
gcaatatctc 



gtgaaagctc 
ataaaaacct 
gcagacctgc 
taaccatctg 
atcaaaagat 
tatctggctt 
tttccaacat 
cctcaagcaa 
agctattgag 
agagcttagg 
gcagtggagg 
acatcctgga 
acctggccag 
acgctgccca 
gcctgcccca 
tcaggaagag 
ccaagggtgc 
ttaatatgga 
atgcaagaga 
tccggagatg 
ttctcatgga 
tcgaagaccg 
tccagctctg 
gcagactgtt 
gtgaatccat 
cagaccgcaa 
ctgtggttcc 
gaaagctaga 
gctccacgcc 
tgaccctttc 
gaattggaat 
cccgctttaa 
tggccaggat 
ggtatcagtt 
tcttcaatct 
tctctgcaga 
gctataacct 
gtttctttat 
caccaatcaa 
catggaccat 
ccctcctgta 
tggaaggcca 
ctcttctccc 
caaaagctca 
ggagaagcag 
tgtcatctat 
agaggaagca 
gggatgactc 
gacctaacag 
ggagcagcca 
caaactcaac 
tagaggaaac 
agtgctggcc 
gctgctctca 
cgagttctca 
tacaacccag 
aggatatgga 



acagcctcct 
actgaccaag 
caaggcttcc 
caactctgtc 
agtggaaaat 
cacttattgc 
tgagagtaat 
ggctctgggg 
cctcagccag 
ggccaacgtg 
tgactccact 
gtctgggtca 
gcctgagttc 
tgcctacagc 
gggcacctgc 
aatgtctgtg 
tgactcggtc 
aaagaagctg 
tggcctgcgc 
ggccagtttc 
aactgcacag 
gctgcaggaa 
ggtcctgact 
aaatcagacc 
cctcaattgt 
gctctttgga 
agaagctgga 
gaagaagttt 
actccagaag 
cataggtgat 
atctggacag 
gcatctcaag 
ggtggtgtac 
cttctgtggt 
cttctttacc 
aacactcctg 
gtcgactttc 
cccttacctg 
caccatctcc 
tttccacgga 
caatgccacc 
gctctcaaac 
aagatacttt 
gaaaattgac 
acagaggcct 
cacaggacag 
tgtggaagag 
atgctcaggg 
gataatggcc 
tcgccgatcc 
agaggccacc 
aatgtgccag 
ccaaggggtc 
ggcagcacat 
aatcggttta 
tgccgtaagt 
agtttctggg 



1980 
2Q40 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5348 



<210> 60 
<211> 5149 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> incyte ID No: 6999183CB1 



atgagcaaga gacgcatgag cgtgggtcag caaacatggg ctcttctctg -agaactgt 60 

™ K2SJ SSSK S2K f •» 
™? 2252 22SS 222! SS2S =gg 300 

atggcaggaa gaacaatcat ggggtggcct gatgaaaaaa gcatggatga attggatttg Jbo 
aac?at?caa Lgacgcagt gagagtcatc tttactgata ccttctccta ^catttgaag 420 
ttttcttggg. gacatagaat ccccatgatg aaagagcaca gagaccattc agctcactgc 
2gcag?ga atgaaaSaat gaagtgtgaa ggttcagagt tctgggagaa aggctttgta 540 

SSSS SS SSSS =S 2 g W ,oo 
SS5S5 SSK2 SSSSS S2SSS S S | £ 
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?tgaaaggtg tggtgtttga "tatatgaa ^-agatca ctgccctcct tgjtcacagt ^ 

S52S 2222 22K2 L ? a . ™ a? .so 
2SSS 2£=S 2222 22SS — i ijoo 

cta??gga?g aaccgactgc tggattggat cctctttcaa ^accgaat atggaatctc 1860 
ctgaaagagg ggaaatcaga cagagtaatt ctcttcagca c ^ a 9"tat 1980 
gacattctgg cggacaggaa ggtgttcata tccaatggga agctgaagtg tgcaggctct i*» 
tctctgttS ttaagaagaa atggggcata ggctaccatt taagtttgca tctgaatgaa 2040 
aggtg?gatc cagagagtat aacatcactg |ttaagcagc -atctctga tgccaaattg 2100 

£252 £222 £222 S2SS ttjggtt 2220 

?rrataacaa ctttgaatga ggtgtttctg aaattagaag gaaaatcaac tattgatgaa ^bu 
22222 gaat??gg?g acaattacaa actgatgggg caaaagatat aggaagcctt 2340 
ccagatacuy y rrt-rt-i-ccttc cacaaaacaa ggaaaacaat cagtggcgtg 24UU 

S S 5S5S S2SS 22=2 12 
2222 222! 2S2S 2=22 

aaXagacag ggtcaacca? tgataacttt ttacattcac tgaggcgaca ^acatagct 27 

2232 2222 2225 222: 22S2 -2 c ago 

22lgS ttfctgtcct cctggatgtc attagcaatg gactacttgg aatttttaat 2880 
tcgtcaglac acattcagac tgacagaagc acattttttg aagagcatat gattatgag 2940 
tatoqgtacc gaagtaacac cttcttctgg ataccgatgg cagcctcttt ca ctccatac juuu 
2222 gcagcattgg tgactacaaa aaaaaagctc attcccagct acggatttca 3060 
2222 ct?ctgca?a clggtttggc caagcactgg tggatgtttc cctgtacttt 3120 
??gatcctcc tgctaatgca aataatggat tatattttta gcccagagga jattatattt 3180 
ataattcaaa acctgttaat tcaaatcctg tgtagtattg gctatgtctc atctcttgnc 
ttcttgacat atgtgatttc attcattttt cgcaatggga gaaaaaatag twcatttgg 3300 
tcatSttct tc?taattgt ggtcatcttc tcgatagttg ctactgatct aaatgaatat 3360 
ggatttctag ggctattttt tggcaccatg ttaatacctc ccttcacatt gattggctct 3420 
Sattcattt tttctgagat ttctcctgat tccatggatt acttaggagc ttcagaatct 
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acacccgtca tcattgccag ctgtctacgg aaggaatatg caggcaaaaa gaaaaattgc 3780 
ttttctaaaa ggaagaaaac aa?tgccaca agaaatgtct ctttttgtgt taaaaaaggt 3840 
omSSSS gactgttagg acacaatgga gctggtaaaa gtacaactat taagatgata 3900 
SS32 caaaaccaa? tgcaggacag gtgattttga aagggagcgg tggaggggaa 3960 
cccSgggct tcctggggta ctgccctcag gagaatgcgc tgtggcccaa cctgacagtg 4020 
aaacaqcacc tggaggtgta cgctgccgtg aaaggtctca ggaaagggga cgcaatgatc 4080 
acScacac ggltagtgga tgcg?tcaag ctgcaggacc agctgaaggc tcccgtgaag 4140 
acc?tg?cag Igggaatafa gcglaagctg tgctttgtgc tgagcatcct HOT-cc* 4200 
tcaataatac ttctggatga gccgtcgacc gggatggacc ccgaggggca gcagcaaatg 4260 
toSaggtga ttcgggcclc cttlagaaac acggagaggg gcgccctcct gaccacccac 4320 
tSggcag aggSgaggc ggtgtgtgac cgagtggcca tcatggtgtc aggaaggctg 4380 
agaSSttg g??ccatcca acacctgaaa agcaaatttg gcaaagacta cctgctggag 4440 
afgaagctgl agaacctggc acaaatggag cccctccatg cagagatcct ^ggcttttc 4500 
cScaggctg ctcagcagga aaggttctcc tccctgatgg tctataagtt gcctgttgag 4560 

SSSS 2S££ =SSS ™- S 
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S522S SSSK SSSS SSSSS 2SSS 

a?aaaqgcga tcatagggca taagcctaag ccataccatg cagcctttgt gccagcaacc 4980 
aatSIItS SSStgt gttaagttta aaaatgcatt tattatagaa ttgtctaoat 5040 
SSaggft gtcatggaga atgcttaatt ttcttcctct ^acttcaaa atattaaata 5100 
ttttcttatt tttttgatta aagtataaat taagacaccc tattgactt 
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